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Abstract This paper challenges some of the common assumptions underlying the mathematics used to 
^^ describe the physical world. We start by reviewing many of the assumptions underlying the concepts 

of real, physical, rigid bodies and the translational and rotational properties of such rigid bodies. 
Nearly all elementary and advanced texts make physical assumptions that are subtly different from 
ours, and as a result we develop a mathematical description that is subtly different from the standard 
(-h mathematical structure. 

Qh Using the homogeneity and isotropy of space, we investigate the translational and rotational features 

_i of rigid bodies in two and three dimensions. We find that the concept of rigid bodies and the concept 

of the homogeneity of space are intrinsically linked. The geometric study of rotations of rigid objects 

leads to a geometric product relationship for lines and vectors. By requiring this product to be both 

associative and to satisfy Pythagoras' theorem, we obtain a choice of Clifford algebras. 

We extend our arguments from space to include time. By assuming that cSt = 81 and rewriting 
,_^ this in Lorentz invariant form as c 2 t 2 — x 2 — y 2 — z 2 = we obtain a generalization of Pythagoras to 

^- spacetime. This leads us directly to establishing that the Clifford algebra C£(l,3) is an appropriate 

Q\ mathematical structure to describe spacetime. 

Clifford algebras are not division algebras. We show that the existence of non- invert ible elements 
in the algebra is not a limitation of the usefulness to physics of the algebra but rather that it reflects 
accurately the spacetime properties of physical systems. 



Keywords Homogeneity • Isotropy • Rigid bodies • Geometry 



Philip H. Butler 

Department of Physics and Astronomy 

University of Canterbury, 

Private Bag 4800, Christchurch 8140, 

New Zealand 

E-mail: phil.butler@canterbury.ac.nz 

Niels G. Gresnigt 

Department of Physics and Astronomy 

University of Canterbury, 

Private Bag 4800, Christchurch 8140, 

New Zealand 

E-mail: niels.gresnigt@canterbury.ac.nz 

Peter F. Renaud 

Department of Mathematics and Statistics 

University of Canterbury, 

Private Bag 4800, Christchurch 8140, 

New Zealand 

E-mail: peter. renaud@canterbury.ac.nz 



1 Preface 

In recent years three well known theoretical physicists have written books challenging the string theory 
community to reconsider their focus on high-dimensional theories of fundamental physics, especially 
string theory and its derivatives [HG!][2]. Each of these authors expresses their frustration with the 
progress of the past 40 years, and argues the case that there needs to be changes to one or more of the 
current understandings of special relativity, quantum mechanics, quantum field theory, the standard 
model of particle physics and general relativity. 
Penrose [3] ends his case (page 1045) with: 

[T]here are [many] deeply mysterious issues about which we have very little comprehension. It 
is quite likely that the 21st century will reveal even more wonderful insights than those we have 
been blessed with in the 20th. But for this to happen, we shall need powerful new ideas, which 
will take us in directions significantly different from those currently being pursued. Perhaps 
what we mainly need is some subtle change in perspective — something we have all missed... 

The aim of this paper is to review the basic assumptions made about physical space, in particular its 
geometry. From these assumptions we concentrate on developing the most appropriate mathematical 
framework within which to describe physical phenomena. For maximum clarity, we focus on everyday 
sized objects. We invite the reader to follow our arguments. We try to be upfront and clearly state all 
important assumptions. What we find is that the first changes that we wish to make to the physics, 
and to the mathematics we use to describe the physics, are changes at the geometric foundations. 

We introduce the concept of reference frames from the idea of rigid material objects, made of 
real atoms. One dimensional rigid rods are, for us, not an abstraction, but like three dimensional 
rigid bodies, an approximation. The real world is the place where we do measurements, and real 
measurements do not return exact answers. We endeavour to set up an idealised mathematical world 
that is a good model of the physical world. Our approach throughout is akin to the axiomatic approach 
typically found in an introductory mathematics text on vector algebra. 

In section [2] we set up the concept of a 'reference frame' and the concept of a 'straight line' by 
taking the concept of a real, physical, rigid body in a 2-dimensional space and looking at translational 
properties. We find that the concept of a rigid body and the concept of homogeneity of space are linked. 
We then set up the mathematical concept of vectors as elements of a vector space over the field of 
rational numbers, Q. The operations of the vector space are linked to three separate operations on the 
points of rigid bodies: drawing lines between points; moving a rigid body with respect to another; and 
transforming from one reference frame to another. These mathematical and physical operations define 
for us the concept of straight lines used in the expression of Newton's First law. They are intrinsically 
derived from the property of space known as the 'homogeneity of space'. 

In section [3] we extend the ideas from this section [2] and use the isotropy of space to develop the 
rotational properties of 2D space. We shall use the isotropy of space to introduce the concept of a 'right 
angle'. By asking for a product operation that describes rotations and matches Pythagoras' theorem 
we are led from a vector space over Q to an algebra over Q. The algebra we derive is an example of a 
Clifford algebra. 

Section [4] extends the considerations of homogeneity and isotropy from two spatial dimensions to 
three. The isotropy of 3D space and the rotation properties of rigid objects lead to a richer set of 
properties and an eight dimensional Clifford algebra. We demonstrate that the maintenance of cyclic 
structures of sets of basis lines and sets of basis planes, namely the parity conservation properties of 
allowable physical movements of rigid bodies, requires the use of the Clifford algebra C£(0, 3) and not 
the Clifford algebra Ct(3, 0). 

Section [5] studies time as a fourth dimension in a vector space over Q. The observation that the 
speed of light measured with respect to any inertial rigid body is independent of where and in what 
direction the light is traveling gives us a generalization of Pythagoras to spacetime. This directly 
leads us to establish that the 16 dimensional Clifford algebra C£(l,3) is an appropriate mathematical 
structure to describe measurements of the motion of rigid bodies and of light in the reference frame 
defined by a single rigid body. Our derivation requires us to assume that our rigid body frame is 
inertial. Finally we show that all this implies that C£(l, 3) is the appropriate mathematics to describe 
Lorentz and Poincarc transformations between rigid body reference frames and is also the appropriate 
mathematics to describe translations, rotations, and boosts of rigid bodies. 



Finally, section [6] looks at the algebraic structures of Clifford algebras. In particular we will discuss 
the differences between various Clifford algebras and seek the matrix representations of them over the 
reals, K, or its subfield, the rational numbers Q. 

2 Rigid Bodies to Reference Frames, and Homogeneity to Vectors 

The subject of physics deals with a huge range of scales, from well below the size of the proton, < 
10 -15 m, to the size of the universe, > 10 billion light years or some 10 26 m. Even the scales of the objects 
experienced by people in their daily lives range over some eight orders of magnitude, from fractions of 
a millimeter to tens of kilometers. Science has learnt ways to observe and measure objects from well 
below the scale of the proton to the scale of the universe. However in this paper we concentrate on 
understanding everyday sized objects, and developing the most appropriate mathematical framework 
within which to describe such objects. In general we expect the mathematical framework in which we 
work to be larger than our physical space, in the sense that not every mathematical construction or 
operation has a meaningful physical counterpart. However, we do want the converse to hold; every 
physically allowed operation can be represented in our mathematical framework. 

The place we shall begin is to seek to understand what is meant by the geometric content embedded 
in the usual statements of Newton's First Law, for example given by Serway and Jewett [4] as: 

In the absence of external forces, when viewed from an inertial reference frame, an object at 
rest remains at rest, and an object in motion continues in motion with a constant velocity (that 
is, with a constant speed in a straight line). 

It is worth emphasising that it has taken Serway and Jewett some 114 pages of preliminaries to get to 
that statement, not surprising as this quotation contains some dozen words that have a specific physics 
meaning. 

In this section we shall set up the concept of a 'reference frame' and the concept of a 'straight 
line' by taking the concept of a rigid body in a 2-dimensional 'toy world'. (A 'toy world' is one in 
which we can study certain processes in a simple way without being distracted by the full richness 
and complexity of the natural world in which we find ourselves.) The first set of rigid bodies we shall 
consider are a desktop, and a few transparent sheets of paper which we can move about on the desktop. 
In addition to the parameters to describe the position and orientation of the pieces of paper in the 
2-dimensional world of these material items, we will need another parameter to describe when the 
pieces of paper are in their different positions as we move them about. 

We then set up the mathematical concept of vectors as elements of a vector space, where the 
operations of the vector space are linked to three separate operations on the points of rigid bodies: 
drawing lines between points; moving a rigid body with respect to another; and transforming from one 
rigid body reference frame to another. These physical operations define for us the concept of straight 
lines used in the expression of Newton's First law. They are intrinsically derived from the property of 
space known as the 'homogeneity of space'. 

Much of the argument presented in this section forms part of those 114 pages of our physics text 
[4] , prior to Newton's First law, although our approach is more akin to the axiomatic approach of an 
introductory mathematics course on vector algebra than to an introductory physics course. 

We finish this section by comparing and contrasting our conclusions with those of standard treat- 
ments (such as the introductory text above) and with the arguments in other recent research papers. 

2.1 Points, lines and areas of a rigid body 

Let us define various physical idealizations, in particular points and lines, but starting from the concept 
of a 2D rigid body. There is a logical difficulty lurking here and we do not propose getting into a 
philosophers' discussion about evidence for, or the nature of, the 'objective reality' of philosophers. So 
we ignore the circularity issues that arise from our trying to describe a 'rigid object' before we know 
how to define 'rigid' or 'object'. The next subsection will address the first of the properties that allow 
us to test whether or not we have a 'rigid object'. 

Consider a 2D rigid object formed by a desktop. Mark a set of n 'points' A, B, C, . . . on the 
desktop. We may take these points to be special 2D (rigid) objects idealized as being of negligible or 



zero size in each of the two dimensions of our toy world. Now join these points up to form 'lines', see 
figure 1. Again, we need a workable concept of a line. Let us assume we have 'strings' or rigid rods 
that are a special kind of rigid object that we can idealize to be of finite length but of negligible or 
zero width. 

There are n 2 possible lines AA, AB, AC,. . . , BB, BA, BC,. . . . Some authors would call our lines 
'directed line segments', but we have no need for the (non-physical) concept of lines of infinite length. 
All our 'lines' are directed line segments (or 'points' if they are of zero length). The line AB is from A 
to B, where we say that A is the 'tail' of AB and B the 'head' of AB. A line PQ for the case where 
P — Q is a special case in that it has zero length and no direction. In a natural geometric sense we can 
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Fig. 1 A unique line exists between any two points on the desktop. The line AB is from point A to B. Given 
n points in the space, the total number of possible lines is equal to n 2 . 

define the 'passive addition' of lines to lines on the desktop and give geometric meaning to expressions 
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AB+BC=AC 
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Fig. 2 The 'passive' operation of joining lines to lines consists of geometrically joining the head of one line to 
the tail of another line. 



such as: 



AB + BC = AC 
(AB + BC) + CD = AB + [BC + CD) = AD 



(1) 
(2) 



This passive addition is just a matter of joining lines, head of the first to tail of the second, see figure 
2. 

Likewise in a natural geometric sense we can define the passive addition of a line to a point and 
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Fig. 3 Active addition is the movement of a point from the tail to the head of a line. This figure shows both 
active and passive translations. Active translations can be used to move points; A is moved to A' by the line 
AA' , or entire lines; the line AB can be moved to the parallel line A'B' using any one of a number of active 
translations such as AA' and CC' . 



give meanings to expressions such as: 



A + AB = B, (3) 

(A + AB)+BC = C (4) 



This addition is passive as there is no movement of the object, rather the point B may be considered 
as a relabeling of point A. 

It is important to note that passive addition does not contain any concept of translation or equiv- 
alence. Therefore we are limited to adding lines where the head of the first line coincides with the tail 
of the second line, such as AB and BC in figure [2] It makes no physical sense to add the lines AB 
and CD of figure [T] together. Likewise we cannot add a line AB to a point P unless the tail of AB 
coincides with P; that is A = P. 

Passive additions arc of very limited use and much more useful are active additions which we discuss 
in the next subsection. Introducing the concept of translations and equivalence we can translate lines 
through space keeping their length and orientation the same, we can use active translations to move 
points; A is moved to A' by the line AA' , or entire lines; the line AB can be moved to the parallel line 
A'B' using an active translation such as AA', see figure |3j and discussion later in this section. 

For completeness, we note that three points A, B, C, define a triangular area. We return to this 
concept in more detail in the next section. 



2.2 Rigid body translations and the homogeneity of space 

By considering the motion of several rigid bodies we are led to the 'active' addition of points and lines, 
then to the concept of the 'homogeneity of space' associated with active motion. 

Consider having a 2D rigid transparent object, e.g. a sheet of transparent paper, which we can slide 
about on the desktop. Mark the points A', B', C ,. .. on the paper directly above the corresponding 



points on the desktop A, B, C, .... At this initial position we have A = A' , B = B' , C = C , .... 
After a 'translation' of the paper, Trans(A — *• A'), the lines AA' , _Bi?', CC , . . . are parallel to each 
other, and of equal length. A translation is defined here to be a movement of a rigid object that 
is compatible with the ordinary English meaning of translation that is 'movement in the absence of 
rotation'. Mathematically, we say that the lines AA' , BB' , CC , . . . are equivalent to each other. Any 
line A A' on the desktop is equivalent to a whole class of parallel lines of the same length on the desktop. 
We write [AA'] to denote this set of lines called an equivalence class. 

Alternatively we may use as our definition of translation the observation that, after the motion of 
the paper relative to the desktop, the lines AA' , BB' , CC , . . . are parallel and of equal length. The 
translation can be equally well described by Trans(A — >• A'), or Trans(B — >■ B'), or Trans(C — > C), 
. . . , because the lines AA' , BB' , CC , . . . belong to the same equivalence class. These properties can 
be tested in our physical world by having a third rigid object, say another piece of paper, on which 
we mark X at A and Y at A' and slide it about, without rotation, to compare the separation of the 
other pairs of points, B and B', C and C", .... To compare the length of BB' to AA' we need only 
translate the second piece of paper X to B, whereby Y will be at B' and no rotation is required. 

As an aside, we observe that the translational motion described above, that has the lines AA' , BB' , 
CC , . . . parallel, needs to be changed if we change from our flat desktop to a curved 2D surface, such 
as the surface of the earth. When sliding objects around curved surfaces it is necessary to generalize 
to a process known as 'parallel transport'. 

The homogeneity of space is the name we give the above geometrical behaviour of rigid body 
translational motion on a flat surface. The concept of a rigid body and the concept of the homogeneity 
of space are linked. Both require the concept of fixed differences between points, which can be tested for 
self consistency by our pieces of paper. In the rigid body that is the desktop, we can test the constancy 
of the length of each one of the lines AB, AC, AD, . . . , BC, BD, ... by repeatedly using our first 
piece of paper on which we have marked the points A' , B' , C , .... We can do the same with each one 
of the lines A'B' , A'C' , A'D' , ... on the first piece of paper by matching the points A' , B' , C , . . . 
to points A" , B" , C" , ... on the second piece of paper. The combination of the rigidity of the objects 
and the homogeneity of space, requires that the lengths of the various lines on the various objects do 
not change as the objects are moved relative to each other. Finally we can verify that the lines AA' , 
BB' , CC , . . . are all the one length and parallel to each other. We know experimentally if a surface 
is curved by observing that at least some of the lines AA' , BB' , CC , . . . have different lengths after 
the translation Trans(A ->• A'). 

The existence of rigid bodies in a homogeneous space means that we can extend the passive and 
active addition rules above, and expand the notation to use the equivalence, under translation, of the 
various sets of lines. First note the equivalence of the lines XY (on the second piece of paper) and 
AA' , BB' , CC , . . . (between the points on the desktop and on the first piece of paper). Second, note 
the equivalence of the lines on the desktop to the lines on the first piece of paper — the line AB is 
equivalent to A'B' , AC is equivalent to A'C , etc. 

We can say that A A' moves the paper with respect to the desktop by the line AA' , and write that 
all points P on the desktop are moved by Trans(A — >• A') to the corresponding points P' on the paper 

Trans( A -+A'){P) = P + AA' = P' (5) 

Likewise for lines, we can use AA' to move any line BC on the desktop to its position B'C on the 
paper. 

Trans(A^ A')(BC) = BC + AA' = B'C (6) 

In this notation, which we will use henceforth, BC + AA' does not refer to the passive adding of lines 
in the sense discussed in the previous subsection, but rather to the active translation of BC by the line 
AA' . We note that AA' + BC is not equal to BC + AA' as they refer to different active translations. 

Consider now marking on the desktop the points A' , B' , C ,. . . that are directly under the cor- 
responding points on the paper (in its moved position). We now have In points on the desktop. The 
equations above may now be re-interpreted as actions on points on the desktop. In particular any line 
PQ on the desktop may be added to (in the active sense) any other line AB on the desktop, or be 
used to translate (in the active sense of movement) point A or line AB. 

As a further subtlety, the action of translation comes in two physical senses. In the first sense we 
have been considering moving either or both of our two sheets of paper by AA' while leaving the 



desktop unmoved. In a second sense we can move the desktop by A' A while leaving the pieces of paper 
unmoved. We have 

Trans(A -» A') {paper) = Trans(A -> A')' 1 {desktop) = Trans(A' -> A){desktop) (7) 

The assumption of homogeneity of space says these two senses cannot be distinguished in the physical 
world. Relative motion is all that can be observed (or measured). 

2.3 Lines constructed by successive additions 

In preparation for deducing that we need a vector space over a field of numbers, we choose to find 
the smallest field satisfying simple assumptions about the measurement process. We find the field of 
rational numbers, Q, suffices, although it is usual to use the field of reals, BL 
Starting from a line AB, we can form the lines 

2AB = AB + AB (8) 

3AB = 2AB + AB (9) 

giving a natural meaning for the symbol 'nAB\ (We define nAB to be the line on the rigid body that 
starts at point A.) This notation incorporates the property of integers 

nAB + mAB = {n)AB + {m)AB (10) 

= {n + m)AB (11) 

For negative integers, we start from the notion that BA moves the points and lines on any rigid 
body in the opposite direction to the line AB, so that it is natural to write 

-AB = BA (12) 

In general, 

{-n)AB = -{nAB) (13) 



for integer n. Thus equation (111 applies to all integers small enough so that nAB, mAB and {n+m)AB 
belong to the rigid body. Henceforth we consider only the cases where this condition is satisfied. 

Now, let us use the notation ||^4S|| for the length of AB, and use \n\ for the absolute value of the 
integer n. It is a property of lines on a rigid body that 

||nAB|| = \n\ \\AB\\ (14) 

Let us use these notations to compare lengths of parallel lines. (The next section sets up procedures 
for comparing lengths of non-parallel lines.) Consider only lines that are parallel to the line AB and 
begin by translating these to have the same tail A. Choose a line AX that is much shorter than AB 
as our 'short-measuring stick' in the AB direction. Translate AX end-to-end p times (where p is a 
positive integer) until pAX reaches approximately the point B. (To be precise we say that pAX is 
approximately at B if pAX is less than or equal to AB and that (p+ I) AX is greater than AB.) Now 
translate AX end-to-end q times (where q might be positive or negative) until qAX is approximately 
at D' . We conclude that: 

||CD|| = \q/ P \ \\AB\\ (15) 

to the accuracy defined by the length of the chosen 'short measuring stick'. 

Extending our notation above to rational numbers, we may define the line rAB to be the line at 
point A, parallel to AB of length \r\ ||AB||. If r is negative then rAB is sometimes said to be anti- 
parallel to AB. Note that for each given line AB, we have imposed a physical limit to the value of r. 
For a rigid body there is a lower limit ^ m ; n so that rAB is no smaller that the shortest measurements 
of length (the shortest measurable line in the direction AB) on the rigid body, and an upper limit £ max 
so that rAB is no longer than the longest measureable length in the AB direction. (Aside: r is called 
a rational number, not because it is sensible, but because it is a ratio of integers.) 

Having the above definitions and procedures allows us to use any line (and not only a 'short 
measuring stick') as the measuring stick for its direction, but we need the concept of rotations (and 
the isotropy of space, taken here as the invariance of rigid bodies under rotations), to compare line 
lengths in non-parallel directions, see the next section. 



2.4 Discreteness and Continuity 

We have seen how the notion of rigid bodies and the homogeneity of physical space are closely related. 
We have not yet made any assumptions and statements regarding the continuous or discrete nature of 
physical space. 

Whether our field of numbers is chosen to be the reals or the rationals, there is an underlying 
assumption which can be expressed in a number of different ways, perhaps the clearest being that 
between any two numbers, we can find a third. Mathematically we say that the real number field and 
the rational number field are both dense. On the other hand, if space and time are quantized, this 
underlying assumption needs to be examined. 

An argument is presented by Isham [5] to show that the normal quantum mechanical framework 
together with the two assumptions; 

— physical space is homogeneous, 

— any spatial distance r can be divided in to two equal parts, r = r/2 + r/2, 

leads inevitably to the Heisenberg algebra. The authors of [6,7 have argued that the Heisenberg 
algebra, in particular the commutator [xj,pk], must be modified once gravitational effects associated 
with the quantum measurement process are accounted for. The appropriate kinematical algebra for 
this scenario is the Stabilised Poincare Heisenberg algebra (SPHA for short) [8], which does feature a 
modified Heisenberg algebra. 

Any modifications to the Heisenberg algebra necessarily induces an associated change in the un- 
derlying geometry of physical space, with either the homogeneity or the continuity of space (with the 
assumption that any spatial distance can be divided into two equal parts) being lost. The authors of 
[9] have argued that it is the underlying homogeneity of space that is lost in this case and furthermore 
that the induced inhomogeneities may serve as seeds for structure formation in an earlier epoch of the 
universe (at the present epoch of the universe the modifications to the Heisenberg algebra are very 
small and hence one would not expect to observe any inhomogeneities today). 

In contrast, the authors of the present paper have on an earlier occasion shown that the Clifford 
algebra C£(l, 3) generates the SPHA under the action of the familiar Lie bracket [TO]- We show in this 
paper that this Clifford algebra necessarily follows from the homogeneity and isotropy of physical space 
together with Pythagoras theorem (and the generalization to spacetime). In this derivation of C£(l, 3) 
physical space is considered to be homogeneous, however no assumption needs to be made about the 
continuous or discrete nature of physical space. We therefore argue that the spacetime underlying the 
SPHA is homogeneous and therefore not continuous. The usual alternative is to argue that spacetime 
is discrete or quantized. 

Perhaps the simplest formulation of a discrete spacetime is given by Meessen [11| who proposes the 
following basic postulate of spacetime: 

An ideally exact distance measurement along any direction in any inertial reference frame can 
only yield integer multiples of the same universally constant quantum of length a. 

In Mccsscn's formulation, spacetime is a lattice of points with minimum length and minimum time 
scale and furthermore these minimum values are the same for all equivalent observers. In a discrete 
spacetime a given spatial distance can not always be divided into two equal parts. A direct consequence 
of this is that there must exist some indivisible minimum unit of length. However this is not satisfactory 
either. Other ways of quantizing spacetime have been considered, such as quantizing space and time 
via a random 'sprinkling' of points onto a manifold as is done in causal set theory, [12) . In such an 
approach the distances between points vary. 

It seems to us therefore, that to assume either spacetime is continuous or spacetime is discrete is 
unjustified. Some third alternative to quantizing spacetime is required. This issue appears to be a deep 
problem. Therefore, for the time being, rather than make an unjustified assumption we park the issue 
and avoid being distracted by it. 

2.5 Coordinate systems and choices of an origin 

Let us first consider the linear independence of translations and then let us derive the relationship 
between the geometric operations that we have been considering and the axioms of a vector space. 



Our statement of linear independence of lines in our 2D toy world of the desktop is as follows. If 
two lines AB and CD are not parallel, then geometry says that given any two points P and Q (or any 
line PQ) we can find a unique numbers r and s so that 

Q = P + rAB + sCD (16) 

where the equality is up to the sizes of the short measuring sticks in the AB and CD directions. The 
fact that we need precisely two non-parallel lines and the two numbers is why we say we are working 
in a 2D (two-dimensional) world. 

This expression can be rewritten in the form of three lines 

PQ = rAB + sCD (17) 

and we say that the three lines AB, CD, and PQ are linearly dependent. Conversely, two lines AB 
and CD are linearly independent if and only if they are non-parallel. 

Taking an arbitrary but fixed point O, which we shall call an origin, allows us to associate a unique 
line OP for every point P on the desktop. Choosing two more points A and B, we may rewrite the 



linear dependence equation, eq( 17), in terms of P, A and B, or the lines OA, OB and OP, as 

OP =rOA+ s OB 

for any point P. With these choices we say that for origin O, the lines OA and OB are a 'basis' choice 
for the lines on the desktop, and (r, s) are the coordinates of P (or OP) in this basis. 

2.6 Vectors and unit vectors 

Let us now carry out an abstraction process, where we construct an algebraic system called a vector 
space. The vector space replicates many of the addition properties of points and lines. A vector space 
V over a field F is an abstract mathematical construct that is defined by a set of axioms that describe 
the addition of the elements of the vector space (the 'vectors') and the product of vectors with the 
elements of the field (the 'scalars' or the 'numbers'). The key axioms are the Abelian properties and 
the associative properties of the sums of vectors and products of scalars with vectors. 

The axioms lead to the concept of linear independence, which in turn leads to the concept of the 
dimension of the space and the ability to choose a set of basis vectors. 

The usual vector space of freshman physics is obtained by defining a vector as the equivalence class 
[AB] of all lines parallel to AB and of the same length as AB. 

Defining 

a = [AB] (18) 

b = [CD] (19) 

c = [PQ] (20) 



means we can write the linear dependence equation, eq(17), for our vectors above, as 

c = ra + sh where r,s £ Q (21) 

and in particular the equivalence classes for the lines rAB and AB are related by 

[rAB] = r[AB] (22) 

Observe that there is always one line OAq in the equivalence class a = [AB] that has its tail at the 
origin O of the coordinate system. The vector a can then be described by the point A . It is easy to 
confuse, or in some instances conflate, the point Aq on the rigid body, with one or more of the lines 
on the rigid body in the class [AB] , or even with the abstract algebraic entity that is the vector a. 

If the line AB is chosen as the measuring stick in the direction of AB, and we choose units for 
length so that ||AB|| is 1 unit of length, then the vector a = [AB] is called the unit vector in the 
direction of a. We shall usually label unit vectors with a 'hat' symbol, as in a. 
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In general, given vectors a, b, . . . , we may choose unit vectors such that a = aa, b = 6b, .... As 
with lines, when we write a = aa, we say a is the magnitude, or length, of a and we say that a is a 
unit vector in the direction of a. The length a is either zero or a positive (rational) number. 

Observe that vectors are 'mathematical objects', being elements of a vector space, V. Vectors have 
uses outside of geometry, and are often introduced in mathematics course without any connection 
to geometry. In this paper we have them firmly linked to geometric objects. Vectors can describe 
operations on lines (which are passive objects), or translations (which are active objects that describe 
the movement of rigid bodies, with their points, lines and areas). By choosing a point as the origin, 
a vector can describe a point. It is common to confuse these different, albeit linked, concepts: the 
mathematical entity that is a vector and that belongs to a vector space, and the physical entities of 
points, lines, and translations. It has long been known that many beginning physics and mathematics 
students can take a long time to grasp vector algebra because of this. If there are multiple meanings 
of the new words and new concepts, and this is not pointed out, confusion reigns in the students' 
minds. We aim to consistently use a notation that keeps the physical objects clearly separated from 
the mathematical objects. 

It is common to generalize the application of vector spaces from geometric space where vectors 
represent points, lines and translations to other physical objects, such as forces, velocities, accelerations. 
In typical physics notation, Newton's Second Law |4, is written as 

F = 77ia (23) 

where F is a force of magnitude F in direction F, a is acceleration of magnitude a in direction a. Thus 
in terms of magnitudes 



and in terms of directions 



F = ma (24) 



F = a (25) 



since both the unit vectors F and a are dimensionless in the sense of having no units (neither newton 
nor metre/second/second). The only property that unit vectors have, in this formulation, is direction. 
As a further example of the linking of the concepts and the typical abuse of notation, note that a 
can represent a displacement by a distance a (of say 4.5 metre) in direction a (of say 7.3 degrees north 
of east) . It is usual to say that a is a unit vector, when a is really a direction. We shall abuse notation 
in this way to the extent that if a is of length 1, we write a = a rather than a = la. 



2.7 Vector Addition 

The addition of vectors follows simply from the correspondences set up above. To diagrammatically 
represent the addition of two vectors a and b we choose two appropriate lines (in the equivalence 
classes of the two vectors) to represent these vectors. The algebraic equation a + b = c can then be 
related to the geometric picture of joining the tail of the second line (representing b) to the head of 
the first line (representing a), giving a new line (representing the resultant vector c) from the tail of 
the first line to the head of the second. 

The commutativity of vector addition a + b = b + a corresponds diagrammatically to the parallel- 
ogram law, see figure 4. We also have associativity (a + b) + c = a + (b + c), see figure 5. 

2.8 Concluding remarks - What have we achieved? 

In this section we have seen that the observed translational features of rigid objects in the geometric 
space of the 2D physical world led to a set of operations on the points, lines and areas of rigid objects. 
They also led to the abstract construct that is the mathematical structure of a 2 dimensional vector 
space V over the rational number field Q. The vector space axioms are chosen so that the mathematical 
structure of the vector space matches the geometry of space, in particular its homogeneity. However 
the operations on rigid bodies are described by a subset of those rationals, limited by 
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Fig. 4 By choosing appropriate lines to represent vectors, this figure demonstrates the commutativity of 
addition of vectors. The vector resulting from adding the vector a to the vector b is the same as the vector 
resulting from adding b to the vector a, so that a + b = b + a. 




Fig. 5 This figure demonstrates the associativity of vector addition (a + b) + c = a + (b + c) 



Generalising to the full physical world, we can see no physical situation in which we need the 
number oo nor lines whose length approaches zero by an infinite (or Cauchy) process. Points, lines and 
areas are the 'observables' of our world of finite sized rigid bodies. 

We remind the reader that we avoid making assumptions about the appropriate mathematics except 
when they have a firm basis in measurements and observations within the world under consideration. 
As a particular example we considered the issue of continuity conditions on the number system that we 
need to use, and has concluded that we need only the rational numbers. Typically, continuity conditions 
are assumed. However various theories of spacetime assume a graininess to spacetime or a "quantum 
foam" . We will return to such issues later. For the present we ask the reader to not get distracted by 
these issues and to explore the 2D (and the 3D) world as we find it. 

Finally, we also remind the reader that although we insist on being able to represent every physically 
allowed operation within our mathematical framework, the converse does not hold and there are many 
mathematical operations which have no physically meaningful counterparts. 
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In the next section we extend these ideas to take into consideration the rotational properties of 
2D space. We are led from the vector space over Q to an algebra over Q. An algebra contains the 
vector space operations of multiplication of vectors by scalars, and the addition of vectors to vectors. 
It contains also the operation of multiplication of vectors by vectors. The algebra we derive is an 
example of a Clifford algebra. 



3 Isotropy plus Pythagoras gives a Clifford Algebra 

In this section we consider the rotational motion of 2D rigid bodies. This leads to a product operation 
of vectors with vectors, giving rise an algebra that corresponds closely to the isotropy of physical 2D 
space and also to the rotational invariance of rigid bodies in this space. 

There is a subtlety we have not mentioned: When studying homogeneity by means of translations 
we talked of lines such as AB. Newton's first law talks of 'straight lines'. We drew our lines in our 
toy world as straight lines, but homogeneity would seem to require that lines are merely of constant 
curvature. However the rotation of the line AB by it about its centre, A + \AB, enables experimental 
verification that all intermediate points on line AB between A and B, also lie on the rotated line BA. 

The algebra we obtain in this section is four dimensional. We shall see in section [4] that a n-D 
world naturally leads to an n-dimensional vector space to describe the homogeneity and translation 
properties of the physical space, and to a 2 n -dimensional algebra to describe its isotropy and rotation 
properties. 

We noted that the physical world did not satisfy all the axioms of the vector space, in particular 
the physical world is finite in extent, both in the very large and the very small. Here we maintain our 
approach to the assumptions underlying the basic laws of physics: we shall only make the assumptions 
we need to, and propose mathematical axioms that seem to be required - absence of evidence is not 
evidence of absence, nor a reason to make assumptions to simplify the mathematics. 

This section continues our study of our 2D toy world of finite extent (finite both in terms of how 
small and how large) to deduce some of the geometrical consequences of rotational invariance. The 
rotational invariance shown by all rigid bodies in the physical world is known as the 'isotropy of space'. 
The consequence of our contemplations is to find a natural way of comparing lengths of non-parallel 
lines and to extend the vector space to a Clifford algebra. 



3.1 Isotropy and rotational invariance of rigid objects 

Consider our toy world consisting of sheets of paper on the desktop. The most general motion of a sheet 
of paper relative to the desktop is described by giving the initial (A, B) and final (A' , B') positions 
on the desktop of two distinct points (A, B) of the paper. 

We say that we have a rotation about a point A if that point does not move, A ~ A' . If however 
A 7^ A' the motion can be described cither as a translation A to A' followed by a rotation about A' , 
or in some special cases simply as a rotation about some other fixed point. In general if we are given 
the initial location of two (distinct) points, A and B, and the final location of those points, A' and 
B' , then we can describe the motion as a translation A to A' described by the line AA', followed by a 
rotation about A' where the point B + AA' is rotated to B' . Since our world is of finite extent, there 
are many cases where there is no fixed point. A pure translation is not a rotation about "the point at 
infinity" as that point is not in our physical world, nor in our vector space. 

For simplicity let us first consider rotations about a fixed point A, so that A = A'. It is easily 
confirmed in our toy world that two rotations of a sheet of paper about the same point are equivalent 
to a single rotation. There are several special cases of immediate interest: 

— The null rotation, radian (or 0°) where A'B' = AB, for all points B. 

— The rotation through 2tt (or 360°) where again A'B' = AB. 

— The rotation through tv (or 180°) where A'B' = —AB. This rotation applied twice is equivalent to 
the null rotation. 

— The rotation through \-k (or 90°) where we say that A'B' is orthogonal to AB. This rotation 
applied twice takes AB to —AB. 
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— The rotation through |7r (or 270°, or — ^ir, or —90°) where A'B' is again orthogonal to the line 
AB and again two such rotations take AB to —AB. 

We observe that when rotating objects in our toy world, then it is a property of the space, and of rigid 
bodies, that the rotation by any multiple of 2-7T is equivalent to the null rotation. A rotation through 
angle 9 has the same effect on the paper as a rotation through the angle 2tt + 9. 

The last two of the rotations in the list above, those through \-k or |7r, are characterized by the 
property that applying either of them twice gives a line AB" that is parallel to BA (or equivalcntly 
to — AB). This property is so important that lines at an angle of ir/2 (or 90°) to each other may be 
described in several ways in English: e.g. at right angles, normal, perpendicular, orthogonal. 

In the above we have used two equivalent descriptions of a rotation of the sheet of paper, as an opera- 
tion on lines Kot(AB — > A'B')(paper), or as an angle about a point Rot(9(AB, A'B') about A)(paper), 
where 9(AB, A'B') is the anti-clockwise angle between the lines AB and A'B'. As with translations, ro- 
tations of the paper in one direction are equivalent to rotations of the desktop in the opposite direction. 
For example we have the equalities: 

Rot(^S -> A'B')(paper) = Rot (9 (AB, A'B') about A)(paper) 

= Rot(-9(A' B' , AB) about A)(paper) 
= Rot(^'B' -> AB)(desktop) 
= ROT(9(A'B',AB)&bout A)(desktop) (26) 

which all depend on the observation that space is isotropic. The 'isotropy of space' is the name we give 
for the property that a pair of rigid objects do not change their relationship, one to the other, when 
they are both rotated equal amounts. As with the 'homogeneity of space', isotropy is a property that 
requires the concepts of rigid objects (in our case, at least two pieces of paper and the desktop) and 
of motion relative to a reference rigid object (in our case, any one of the objects). 

We began this subsection by considering several cases of rotations by special angles, 9 = 0, ^n, ir, |-7r, 2it, 
etc. In a manner similar to the definition of adding and dividing lengths in the previous section, we 
can define rotations by angles that are rational fractions r = p/q of 7r, where r € Q, such that 

ROT(r7r, about A)(AB) = AB' to the desired observable accuracy (27) 

3.2 Unit measuring sticks and unit vectors 

Another property of our toy world is that any rotations except those through an integer multiple of it, 
take AB into a line AB' that is linearly independent of AB. Both the 'short measuring stick', AX, and 
the 'measuring stick' AB, of the previous section can be represented by pairs of points on the sheet 
of paper. Rotation of the paper from the direction of AB into the direction of another line CD allows 
the comparison of the length of two sticks in two directions of AB and CD. Using the translational 
and rotational invariance of our measuring sticks we can make comparisons of the lengths of all lines 
in the plane. We therefore conclude that, because of the homogeneity and isotropy of space, only one 
measuring stick is needed. 

A pair of orthogonal lines, XX' and YY' , in our toy world gives rise to a pair of orthogonal vectors 
x = [XX'] and y = [YY] in our vector space. By choosing the measuring stick to be of unit length 
(say 1 metre) , we can choose the corresponding vectors to be of unit length. We write them as x and 
y. Pairs of orthogonal vectors of unit length are said to be orthonormal pairs. 

Since our desktop world is 2D, any vectors a and b can be written in terms of the orthonormal 
vectors x and y. 

a = aa = a^x + a y y (28) 

and 

b = fob = b x x + b y y (29) 

It is customary to say that the numbers a x and a y are the components of the vector a in the orthonormal 
basis system (x, y). 
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The axioms of the vector space allow the addition operation to be written as 

a + b = (a x + & x )x + (a y + b y )y 



(30) 



Each of these vector space equations may be carried across to corresponding operations on lines and 
translations in the plane. A line AB may be written in terms of unit orthogonal lines XX' and YY' 

as 



and 



AB = aXX' + bYY' 



A + AB = B = A + aXX' + bYY' 



(31) 



(32) 



3.3 Multiplication of lines by lines and vectors by vectors 

We wish to have a geometric definition of the 'associative multiplication' or 'product' of one line, AB, 
by another, CD, which we shall denote by the ordered pair (AB, CD). First translate the line CD so 
that C moves to A. Thus C = C + CA = A and D' = D + CA. Next translate the line CD so that C 
moves to B. Thus C" = C + CB = D and D" = D + CB. Define the geometric entity associated with 
the ordered pair (AB, CD) to be the parallelogram ABD'D" as shown in the figure [6] 



AB 



CA 



B=cr 



CB 




DD' 



M 



Fig. 6 The 'product' (AB, CD) of the lines AB and CD is defined as the parallelogram formed by translating 
the line CD first to AD' then second to BD", and translating AB to D'D" 



Now define the 'multiplication' of one vector, a = [A<4/], by another, b = \BB'], creating a 'bi- 
vector' denoted by ab, as the equivalence class of all products (AA' , BB') under appropriate equivalence 
relations. 



ab= [(AA',BB')} 

= The equivalence class of all line pairs (CC' , DD') that 

are translationally and rotationally equivalent to (AA' , BB') 



(33) 



The first equivalence relation to use is the translational invariance inherited by the bi-vector from its 
vectors 



ab = [a, b] , where a = [AA'] and b = [BB'] 



(34) 
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We also impose on ab, associativity, bi-linearity over the field Q, and rotational invariance. The easiest 
way to do this is to define ab in terms of its expression in orthonormal coordinates. (Joyce and Butler 
\TS\ give a purely geometric argument.) Define ab as the term- wise associative expansion (the free 
product |14| ) of the components written in some orthonormal axis system. 

ab = (a x x + a v y)(b x X + b y y) 
= a&ab 
= a x b x yi 2 + a y b y y 2 + a x b y x.y + a y b x yx (35) 

where we have used the property that the components, a x , a y , b x , b y , being numbers, commute with 
the unit vectors. However the product of the vectors is not commutative, as we shall explore in detail 
in the following. 

We now ask that the product incorporate the Euclidean metric and in particular ask that Pythago- 
ras' theorem holds. For the product of vector a with itself, we have 

aa = a 2 

2-2 

= a a 

= a 2 x 2 + a^y 2 + a x a y (x.y + yx) (36) 



If this is to satisfy Pythagoras, 



then wc must have 



and 



a 2 = a 2 x + a 2 y (37) 



a 2 = x 2 = y 2 (38) 



xy + yx = (39) 

We want the smallest algebra that contains both x and xy (and thus also y , a. and yx). 

We first choose a be the rational number r\. In the previous subsection we chose unit vectors to 
have equal length, which we declared to be the length unit, or 'standard measuring stick'. The link 
between unit vectors and the standard measuring stick can be rescaled by any number in our field Q. 
However that number appears as a square in eq( |38[ ), so we have two independent cases for ij, depending 
on whether a is positive (77 = +1) or negative (77 = —1). 

The number 77 is known as the metric of the space. The choice of 77 = — 1 gives a 2 < for all a. We 
shall call this choice the 'anti-Euclidean metric'. In the next section we compare and contrast the two 
possible choices of metric, 77 = ±1. 



The pair of equations, eq(38) and eq(39l, define the Clifford algebras C£(2,0) and C£(0, 2) as 
77 = ±1. 



The second equality, eq(39), introduces a fourth basis element k (beyond 1, x and y) 

k = xy = -yx (40) 

into the algebra. We have created an associative algebra of the four basis elements 1, x, y and k where 

- 2 

k = (xy)(xy) 

= -x(yy)x 

= — 77XX 

= -1 (41) 

for both choices for r/. 

We shall see that k is the algebraic unit that describes a unit area in the xy-planc, it is not the 
normal to the plane - such a normal does not exist in our 2D geometry. Instead, just as the basis 
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vector x is the direction of the cc-axis and is dimensionless, so the basis bi- vector k is what we may call 
the 'direction' of the xy-plane. Being the product of two dimensionless quantities, k is dimensionless 
and, as we shall see, is associated with the angle \~k radian. 

The four objects (1, x, y, k) together with their negatives (— 1, — x, — y, — k), form the eight element 
Clifford groups, Ct s rou P(2, 0) or Ct s rou P(0, 2), associated with the Clifford algebras C£{2, 0) or C£(0, 2) 
as r\ = ±1. The group combination law is the associative product defined above in eq(|35|) . The same 



four objects are also the basis for the four dimensional vector space Q over our field, Q, using the 
addition operation, with the arbitrary element written 

A = a + bx + cy + dk where a,b,c,deQ (42) 

An algebra is that mathematical structure that has both the addition and scalar multiplication 
operations of a vector space, and also the associative multiplication operation of a group. In our 
case the general elements of the algebra are linear combinations of arbitrary scalars a, vectors a, and 
bi-vectors ab. In the above we have derived a four dimensional algebra that is firmly based on the 
homogeneity and isotropy of our 2D physical toy world, being sheets of rigid paper on the rigid desktop. 
Let us now explore the vector product, ab. 

3.4 The geometric information in the vector product 

The vector product ab represents both the angle between the lines that correspond to a and b, and 
segments of the plane (parallelograms) spanned by the lines that correspond a and b. It has lost the 
information about the absolute lengths of a and b, as can be seen using the bi-linearity of the vector 
product 

ab = (-a)(rb) (43) 

r 

We remind ourselves that vectors have, in a similar sense, lost the information about the positions of 
lines, vectors have only length and direction. 

The vector a represents any vector in the translational equivalence class a = [AA'] and similarly 
for b = \BB']. So the equivalence class ab = [([AA'], [.B £?'])] knows neither the start of the lines, nor 
their length, only the product of their lengths. We shall also see that it knows only the difference in 
the directions of the lines, ab is invariant under rotations in the plane of the desktop. 



In the next subsection we shall prove (see eq(55|) the following. Consider the pair of lines AA' and 



BB' and form the product (AA', BB') with corresponding bi-vector ab. Take two other lines CC and 
DD' in the desktop, to give the product (CC , DD') and corresponding bi-vector cd. Then this second 
product belongs to the same equivalence class of products as (A A', BB'), that is bi-vector cd equals 
bi-vector ab, if and only if both ||AA'||||BB'|| = ||CC"||||DL>'||, and the angle 9(AA' to BB') equals 
the angle 8(CC to DD'). 

Just as the vector a represents an equivalence class of lines (passive geometric objects) and also 
represents an equivalence class of translations (active geometric objects), we shall see that the vec- 
tor product ab represents an equivalence class of line pairs (passive geometric objects), and also an 
equivalence class of rotations (active geometric objects). 

3.5 Rotations using bi-vectors 

The choice of r\ = —1 leads to counter-clockwise rotations in what follows, while the choice of r\ = +1 
leads to clockwise rotations. Often the same effect can be obtained by writing the operator on the 
right instead of the left. For the remainder of the paper, except where we state otherwise, we choose 
the valuqH 

?/ = -1 (44) 

1 The reader is encouraged to work through the equations of the remainder of this section using t) as a 
variable or with r) = +1. 
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because handedness and parity-conservation arguments in section HI show that this choice is appropriate 
for the geometry of the rigid objects of the Universe. 

With this choice of rj, we may calculate that the basis bi-vector k when used as an operator acting 
on the left, rotates x into y, and y into — x, as follows: 

kx = (xy)x 
= x(yx) 
= -x(xy) 
= ~(xx)y 

= -ij9 

= y 



(45) 



and 



ky = (xy)y 
= x(yy) 
= ryx 
= — x 



(46) 



Thus on taking the two vectors x and y as an ordered pair, (x,y), k is the W rotation of this pair in 
the positive sense 



ROT(k)(x,y) =k(x,y) 

= (y,-x) 

= ROT(±7r)(x,y) 



(47) 



and — k is the |tt rotation in the positive sense, the ^tt rotation in the negative sense, or the k operator 
acting on the right 

ROT(-k)(x,y) = -k(x,y) 
= (x,y)k 

= (-y,x) 

= RoT(-i7r)(x,y) (48) 

Since k = — 1, DeMoivres' theorem may be used to write the exponential function exp(#k) as the 
sum of sine and cosine terms. For any object such as k that squares to —1 we have 



exp(k0) = e = cos 



k sin ( 



(49) 



-1 and exp(i(j 



This result is a generalisation of the result for the complex numbers, where i 
cos + i sin <j). 

We may transform the expression for a in orthonormal, Galilean coordinates (x, y ) , eq( 28 ) 

into circular polar coordinates (r 



a = a^x + o y y 
where r = a 



a r = a cos ( 



a sin ( 



and so using eq( 49 1 and kx = y 



a = a cos 6x + a sin 9y 



a = aexp(k#)x 
= ae fce x 



(50) 



(51) 
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The operator exp(k</>) of eq(49) rotates vector a, when operating on the left, by angle 



e k ^a = e k *ae ke x 

3 k(0+9) 



ae^+^x (52) 

However it does not behave this way acting on scalars, r g Q, or on itself. Thus to write a formula for 
multi- vectors (scalars, vectors and bi- vectors), we require a different form for the operator. This form 



is as a two-sided operation: If A is an arbitrary element of C£(0, 2), as in eq(42 1, then 



ROT(by (j) in the k plane) (A) = e ^/2 Ae -U/2 (53) 

because k commutes with scalars and itself, and anti-commutes with the mono- vectors x and y. As we 
have seen, scalars (that is, numbers) and the bi- vector k are unchanged by rotations in the xy-plane, 
so a + dk — > a + dk while the vector part of A, 6x + cy, is rotated correctly 

e k0/2 Ae -k0/2 = e k0/2 (a + b± + cf + d k )c -k0/2 

= e k ^ 2 e- k ^ 2 (a + dk) + e k ^ 2 e k< ^ 2 (&x + cy) 

= (a + dk) + e fc * (bx + cy) (54) 

where we have used the fact that x and y anticommute with k. 



The general bi- vector ab, eq(35), can be written in terms of scalar and pure bi- vector terms as 
follows 

a = ae^'x 
b = 6e ke "x 
ab = a6e ke "xe k06 x 
= -afoe k («) 
= — ab cos 9 a i, + abk sin 6 a t, (55) 

showing that ab depends only on the product ab of the lengths of a and b, and the angle between 
them, 0^ = 6b — a . This proves the result stated at the end of subsection |3.4| above. In subsection |3.6| 
below, we shall obtain a simple expression for the bi- vector for half the angle between lines a and b as 



it is needed for rotating general multi- vectors A, as in eq(53). 

In many situations the coordinate free representation of the above results is powerful. Recall that 
ab is the equivalence class of all line pairs [(AA', BB')] and where a = [AA'\ and b = [BB']. In general 
we have that the vector r is rotated through the angle from a to b, into the vector r', by multiplying 
on the right by ah/rjab or on the left by ba/?ya6 as follows 

ROT(a-» b)(r) = r' 

= rab/ (r/ab) 

= bar/ (r/ab) (56) 

(A metric free version of the above is obtained if a and b are of the same length, a = b, when the 
rotation operator is simply ab/a 2 ). 

Because the algebra is associative, this rotation operator has a trivial action on a vector a, acting 
from the right 

ROT(a — > b)(a) = aab/(rya6) 
= r\a h/(r]ab) 

- °b (57) 
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which is a line of length a in the direction of b. The corresponding results hold for multiplication on 
the left 



ROT(a -> b)(a) = baa/ (rjab) 
= r\a h/(r]ab) 



(58) 



We note that ba is the inverse rotation (the rotation in the opposite sense) to ab, as it rotates b into 
a. ROT(a ->• b) = ROT(b -» a) -1 . 



3.6 Half angle rotations 

Figure [7] shows that the rotation ROT(a — ¥ b) may be composed as the product of two rotations, first 
ROT(a — >• a + b), and then ROT(a + b — >• b). 



ROT(a -> b)(A) = ROT(a + b ->• b) (ROT(a -> a + b)(A)) 
If a and b are of equal length, a — b then the two rotations are through equal angles. 



(59) 




Rot(a+b-^b) 



Rot(a^a + b) 

— ► 



a+b 



Fig. 7 The rotation a > b may be composed as the product of two rotations, from a to the diagonal (a + b) 
and then to b. 



If a and b are not of equal length, and we wish the steps to be equal, then we need to rescale. We 
could choose a' = a and b' = b, but rather than using unit vectors, let us keep it somewhat more 
general and define c as 



c = ba. + ah 



(60) 



The rotation operators for a to c, and for c to b are equal, and are therefore both equal to the square 
root of the rotation operator for a > b. 



ROT(a -> c) = Rot(c -> b) 
= ROT(a-s> b) 5 



(61) 
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These rotations can be written in terms of their action on a vector r as 

ROT(a — > c)(r) = rac/ (rjac) 

— Rot(c -» b)(r) = rcb/ (rjbc) 

i _ i 

= ROT(a -» b) 2 (r) = r(ab/(jja6)) 5 

= r(ba/(rya6))5 (62) 

where the final equality comes from the result that ROT(a — > b) = ROT(b — > a) 
The rotation of the general multi-vector A is thus given by 

ROT(a^ b)(A) = (ab/ (r/ab))- 5 A (ab/(7?a6)) 5 
= ((ca)/(r?ac))A(ac/(7?ac)) 
= caAac/(a 2 c 2 ) (63) 

The explicit appearance of the lengths of the various vectors in some of these equations suggest that 
square roots of products, such as a 2 , will need to be taken. However the final result is square root free, 
if the original vectors a and b are of equal length. The generalisation of this result to three or more 
dimensions is remarkably simpler than the rotation formulas found in standard texts. 

3.7 Dot and wedge products of vectors 

As in freshman algebra, define the dot product, a-b as the symmetric part of the product, i(ab + ba). 
The cross product is not easily defined in 2D, so instead we define a wedge product as the anti- 
symmetric part of of the product, |(ab — ba). Thus 

ab = ab + aAb (64) 

where 

a-b= 1 (ab + ba) (65) 



and 



aAb=-(ab-ba) (66) 



Returning to the expression cq( 28 ) allows these to be written in coordinates 

a- b = r](a x b x + a y b y ) (67) 

and 

aAb= {a x b y — a y b x )h (68) 

Note that a • b is a scalar (we sometimes say it is a zero-vector) while a A b is a pure bi-vector. Some 
authors use the term 2-vectors for our term bi-vectors, but this can cause confusion with the name for 
a vector in 2D space. Note that the square of any vector is a scalar, a 2 = a. a = rja 2 , it has no pure 
bi-vector part. 

If the angle from a to b is 9b — a , then we may define a = cos(0& — a ) and obtain 

a = cos(9b — a ) 
= a • b/ab 
= {a x b x + a y b y )/(rjab) (69) 

Likewise by defining 

= sm(6 b - 0a) 
/3k = a A bk 

= (axby - a y b x )/(-ab)k (70) 
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This notation allows us to rewrite eq( |56[ ) as 

ROT(a -> b)(r) = r' 

= rcxp(--k(9 b -9 a )) 

= r' x x + r' y y 

= r(a + /3k) 

= (a - /3k> 

= (r x a - r y (3)x + {r x (3 + r y a)y (71) 

or in matrix form, for coordinates written as rows with matrices on the right 

(/*S v ) = (r*,rv)(°pi) (72) 

while for coordinates written as columns and with matrices on the left, we have 

r' x \ _ fa —j3 \ ( r 
r' -[$ a 



(73) 



3.8 Concluding remarks - What have we achieved? 

Section[2]used homogeneity of 2D space to get the well known properties of vectors in 2D. The only non- 
standard claims were (i) that because all physical measurements are finite with upper and lower limits 
£ max and £ m i n , not all the mathematical operations of the vector space have physical counterparts, and 
(ii) that the physics of rigid bodies suggest that Q, the field of rational numbers, is the appropriate 
field. 

In this section we have deduced some old but less familiar consequences of the isotropy of space. 
Our study of movement, with one point fixed, of rigid bodies in our 2D toy world of sheets of paper 
on a desktop, led us to many of the properties of rotations and to an algebra to describe them. 

The concept of a right angle rotation was introduced as a special case of rotations through a 
rational fraction, r of 2tt (or 360°). The angles of 0, ±7r, ±27T, . . . , rnr, are special in that lines AB 
are rotated into themselves or to their negatives. The right angle rotations ±^7r or ±|7r, etc., rotate 
orthogonal pairs of lines AA' and BB' into corresponding pairs BB' and A' A, etc. This approach to 
defining orthogonality from isotropy considerations is not common, usually a metric is defined on the 
corresponding metric space first. 

In our case we introduce the Euclidean metric after defining a product relationship on the lines of 
the physical space, and a corresponding product on the vectors of the previous section. By requiring the 
product to be associative, and to incorporate Pythagoras' identity, we obtain a choice of two algebras, 
each with four basis elements, the scalar, 1, the unit vectors x and y, and the less familiar k describing 
the plane. One algebra corresponds to the Euclidean metric (+, +), and the other to the anti-Euclidean 
metric (— , — ). The next section [4] proves that it is the latter metric that describes the geometry of our 
world. 

The product introduced in this section was introduced by Clifford a long time [15] ago in relation 
to the symmetries of Maxwell's equations, but has been used rarely by physicists. There are however 
some physicists and computer scientists who have used Clifford algebras, see Hestenes 16, 17, 18, 19 , 
Gull [20] , Doran and Lasenby [2T] and the conference proceedings of the Clifford Society [2"2"ll2"3"] . Our 
introduction of the Clifford product of two vectors a and b is by a more geometric route, but one that 
is rather less common [13]. We introduced the bi- vector ab as representing, in a passive sense, the 
equivalence class of two sets of lines that lie in the 2D toy world of the plane that is our desktop. These 
two sets of lines subtend a fixed angle, 6 a f,, between each other. The "angle" in the algebra is thus 
an abstraction of the angle between any of the lines in the equivalence class of the vectors a and b. 
Furthermore, the product of the lengths of the lines (or vectors), ab — ||a||||b||, is fixed. While all lines 
AA' belonging to the vector equivalence class, a = [A4/], have the same fixed length and direction 
this is not true for the bi-vector equivalence class. Instead ab = [(^4A', BB')} can be seen to represent 
the class of all parallelograms in the desktop, which have the same area and subtend the same angle. 
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All parallelograms that are rotations and translations of the first parallelogram {AB, CD) are in the 
equivalence class ab = [{AB, CD)]. 

Any bi- vector ab can be written as a multi- vector with a scalar part, r\ab cos # Q b, and a pure bi- 

vector part, a6sin# a (,k. Although k = — 1, k is not the complex number i. The usual formulation of 
rotations in a plane use complex numbers, giving the four basis elements (x, y, ix, iy) to use. In the 
Clifford algebra formulation derived here, the product properties of the basis elements (l,x, y, k) are 
very different to the complex number properties. 

In parallel to the above passive interpretations of bi- vectors ab, the bivectors are, in an active sense, 
operators that rotate all elements of the Clifford algebra, C£{0, 2). Scalars and pure bi- vector elements 
are unchanged under rotations, while for vectors rotation has a very simple formula: ROT(a — > b)(r) = 
rab/a 2 . The expression for the rotation of the general element A of the algebra is not much more 



complicated, and is given by eq(63). In this active sense bi- vectors correspond to a class of line pairs, 
{AB, AC) that rotate the points, lines, areas and indeed entire rigid bodies, relative to each other, 
about the point A, being point in common of the lines AB and AC. Any of the vector and vector- 
product results of this section can therefore be written as purely geometric expressions acting on the 
points, lines and areas of rigid bodies by selecting representative lines and line products for the vectors 
and vector products. 



4 Motion in 3D and Parity gives C£{0, 3) 

The generalization to three spatial dimensions of the results of our 2D toy world of the previous 
two sections is straightforward. This is particularly true for extending homogeneity considerations of 
section [2j The key result of is that a 2D vector space over the field of rational numbers, Q, describes 
the homogeneity of the desktop world. The key underlying concept is the invariance of the size and 
shape of rigid bodies under movement in straight lines, subsection |4. 1| will extend the ideas to a three 
dimensional vector space by considering translational motion of rigid 3D objects relative to each other. 
The extension to 3D of the rotational ideas of section [3] follows in a similar manner in subsection 
|4.2| The algebra has the additional basis vector arising from the translational motion, but there are 
two additional basis bi-vectors associated with rotations in each of two extra basis planes, and also 
a new object, a tri-vector that represents volumes. The algebra is thus eight dimensional. The three 
basis bi-vectors of rotation do not commute among themselves and give rise to the quaternion algebra. 



Subsection 4.4 shows that in the case of the 77 = —1 metric choice, there are four sets of basis 
elements in the algebra that behave as quaternionic sets and maintain a cyclic relationship. Since 
handedness is preserved for rigid bodies under the physically realized invariances of space, homogeneity 
and isotropy, we conclude that C£{0, 3) describes space. Further, we conclude that C£{3, 0) does not. 

Many of the algebraic differences between C£(0, 3) and C£{3, 0) are highlighted in section pj] where 
we seek matrix representations of them over the reals, R, or its subfield, the rational numbers Q. 

We end this section in with a few words about transformations between reference frames and by 
showing that the Clifford algebra is a powerful tool for finding formulas for the rotation between 
different orientations of rigid bodies. The expressions obtained, unlike Euler angle formulations, do not 
use complex numbers. 



4.1 Translations in 3D space 

In section [2] we considered the 2D toy world of rigid objects consisting of a desktop and several sheets 
of transparent paper on it. In such a world we could move the objects relative to each other, by 
translational motion (sliding) the paper around in 'straight line motion', to use the words of Newton's 
First Law. With transparent paper, any points on one object can be marked on the other objects, and 
after sliding, the distances between points can be compared directly. In this toy world we were led 
via the concepts of relative lengths of parallel lines, and via linear independence, to the mathematical 
concept of the basis vectors of a vector algebra. 

In our 2D toy world, we could bring any two parallel lines to coincidence and compare lengths. 
Extending this process to the 3D world of an office raises an immediate problem. A rigid object, such as 
a book, cannot be brought into coincidence with another rigid object. In the 2D world it is possible, in 
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fact we have two ways of doing it. Several sheets of paper can lie on top of each other, as in our toy world 
we consider only the horizontal position, not the height above the desktop. The parameter 'height' can 
be used to distinguish objects with the same position on the desktop. The second way is by using the 
parameter 'time' to describe the different positions of the points (and lines and parallelograms) of a 
single sheet of paper on the desktop. 

Consider now the example of several books in my office. We can translate the books parallel to the 
horizontal surfaces (e.g. the desktop, floor or ceiling), parallel to the north-south walls, and parallel to 
the east-west walls, or any linear combination thereof. What we cannot do is place two books in the 
same position in the room. We do not have a parameter equivalent to 'height', only a time parameter. 
We will study the time parameter in section [5] 

Another issue is that we cannot compare the points, lines and surfaces inside one book with the 
corresponding points and lines of a second book. We need to restrict ourselves to comparisons of only 
some of the points, lines and surfaces on the surfaces of the books. To look inside we either need to 
take the rigid object apart, or use some form of remote measurement or remote sensing. 

However for many position measurements, the process for 3D rigid objects is little different from 
the process with 2D objects. Distances between points on the surfaces of rigid bodies can be measured 
by direct comparison with points on the surface of another rigid 3D body (using a measuring stick). 
As in the 2D toy world, all such measurements will be limited by the upper limit £ max , and lower limit 
£ m i n , associated with the relevant scales for measurements with the apparatus. 

Bearing in mind these restrictions however, it is clear that any line AB can be written as a linear 
combination of three non-parallel lines, OX, OY, OZ. Correspondingly, we need three basis vectors to 
describe the directions of the passive entities, the lines, and the active entities, the translations. Jumping 
ahead now to the conclusions of the next subsection, we can choose an origin O, and choose these basis 
lines OX, OY, OZ to be orthogonal so that the corresponding basis vectors are an orthonormal set x, 
y, z. We may write the vector r in this basis as 

r = r x x + r y y + r z z (74) 

4.2 Rotations in 3D 

With the same provisos as in the subsection above regarding the somewhat indirect measurement 
process needed for the inside points of a 3D rigid body, the extension to rotational motion of a rigid 
body from the 2D toy world of section [3] to 3D follows simply. 

Consider rotating a book about the corner at the near, bottom, left when it is initially resting on 
the desktop, as shown in figure [51 Rotations can take place about the x, y, and z axes. 






(a) A book lying on our (b) The book after a rota- (c) The book after a rota- (d) The book after a rota- 
desktop. £; on Q f ^j2 j n the i-plane tion of 7r/2 in the j-plane tion of 7r/2 in the k-plane 

or about the x-axis. or about the y-axis. or about the z-axis. 

Fig. 8 A book resting on the desktop, and rotations by 7r/2 about the x, y, and z axes or equivalently the i, 
j, and z planes respectively. 



We showed in section [3] that isotropy and Pythagoras, when applied to considerations of rotations 



in the plane of the desktop (the icy-plane) led to the requirement that x 



i] and also that 
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xy = — yx = k. Applying the argument to the two vertical planes (yz and zx) leads to 



x 2 = 


y 2 = 


= z 2 = 


= V 


xy = 


-yx = 


= k 




yz = 


-zy = 


= i 




zx = 


— xz = 


= J 




xyz = 


V 







(75) 

(76) 
(77) 
(78) 
(79) 



where we define i, j and v as in eqs( 77 to 79 ) . The rotations of figure 1 can thus be equivalently labeled 
as being in the i, j, and k planes respectively, as shown in the figure. The definitions are chosen retain 
the cyclic order of the basis vectors (x, y, z) when defining the basis bi- vectors (i, j, k), and the basis 
tri- vector v. The eight elements {1, x, y, z, i, j, k, v} form the basis of the Clifford algebra C£(0, 3) when 
T] = —1 and the Clifford algebra C£(3, 0) when 77 = +1 . 



The above results for (x, y, z) and definitions for (i, j, k) lead to the properties 



Thus if we choose the anti-Euclidean metric, r\ 
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ij = -ji 


= k 










jk = -kj 


= i 










ki = lie 


= J 










ijk= -1 











(80) 

(81) 
(82) 
(83) 



(84) 
(85) 
(86) 
(87) 
(88) 



These equations are the relations that characterize Hamilton's quaternions [23] 

i 2 = j 2 = k 2 =ijk = -1 (89) 



as the relations of eqs(84 to 88) can be readily derived from eqs(89). 

Observe that there are four sets of basis elements in C£(0, 3) that match the quaternion relations, 
the ordered set of bi-vectors (i, j, k), and the ordered sets (x, y, k), (y, z, i) and (z, x, j). Six of the 
eight basis elements square to —1, namely x, y, z, i, j, k, while 1 and v square to +1. 

On the other hand for C£(3, 0) only one set of the basis elements matches the quaternion relations, 
namely the set of bi-vectors (— i, — j, — k), where the minus sign is required to retain the cyclic structure. 
The bi-vectors i, j, k and the tri- vector v are the four of the eight that square to —1, while the other 
four, l,x, y, z, square to +1. 



4.3 Rotations do not commute 

It is a fact that rotations in different planes are non-commutative. One of the reasons that the appro- 
priate algebraic structure to describe rotations is an algebra, is that products in an algebra are not 
necessarily commutative. In the example below we choose rotations of tt/2 about the basis planes, as 
these are easiest to draw. 

Consider the example of the book of fig 9(a)| initially lying on the desktop ready to be opened and 



read. First rotate the book by tt/2 in the iry-plane, that is the vertical plane, k. 
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Physics 
Assumptions 

to 
Mathematics 

Axioms 




(a) A book lying on our desktop (b) The same book after a rotation (c) The same book after a sec- 
ready to be read of 7r/2 in the xy-plane ond rotation of 7r/2 in the yz- 

plane 



Fig. 9 A book lying on our desktop, and rotated x — > y then y — > z 



The line product (OA, OB) rotates line OA into line OA' parallel to OB, and OB into OB' parallel 
to — OA. Line OC is not moved, OC = OC , see fig |9(b)| The Clifford algebra operation that takes 
the arbitrary element of the book 



A book = a + 6x + cy + tiz + ei + /j + gk + /v 

through the angle 2ir in the xy-pl&ne, to its image A^ ', is 

Rot(x -> y)(A[, o ) ok ) = Rot(OA -> OB)(A^) 



(90) 



= ROT(by tt/2 in plane k)(A[,° o ) ok ) 



, (1) 

L book 



(0) 



= (x + y)xA^ ok x(x + y)/2 

_„£7r/4 A (0) „-fcr/4 



(91) 

This generalisation of the results of section 3 works for all parts of A^j^. For example, the proof for 
multi-vectors follows from linearity and by inserting the identity operator in the appropriate places, 
e.g. 

ROT(7r/2,k)(ab) = e^ /4 (ab)e-* n/i 



= (e^ae-^Xe^be-^ 4 ) 

/ ROT( 7 r/2,k)(a)j (Rot(tt/2, k) (b) 



(92) 



Rotate the book now by 7r/2 in the yz-plane, as shown in fig|9(c) 



The line product (OA, OC) rotates the line OB' into the line OB" parallel to —OC, and OC into 



OC" parallel to -OA. The line OA' is not moved, OA" = OA' which is parallel to OB, see fig |9(c) 
Rox( y -> z)(A« k ) = Rot(OB -> OC)(A« k ) 

= ROT(by tt/2 in plane iXA^J 



= A 



(2) 
book 

/4 A (1) 

book 51 

/4 p k7r/4 A (0) 

book^ 



e-^A^e- 1 ^ 4 



-kir/4 — iir/4 



(93) 
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The book is now standing on its lower edge with its front cover closest to us. The three lines OA, OB, OC 
have moved to lines parallel to OB, —OC, —OA respectively. 

Now do these two rotations in the reverse order, hrst rotate by 7r/2 in the j/z-plane, as shown in 





physics 

Assumptions 

to 

Mathematics 

Axioms 




(a) A book lying on our desktop (b) The same book after a rotation (c) The same book after a second 
ready to be read of tt/2 in the yz-plane rotation of tt/2 in the iy-plane 



Fig. 10 A book lying on our desktop, and rotated y — > z then x — > y 



ng |10(b)| The line product (OA,OC) rotates the line OA into line OA' parallel to OC, and OC into 
OC' parallel to -OA. Line OB is not moved, OB' = OB. 



.(0) ^_ 



(0) 



ROT(y -> z)(A^ ok ) = ROT(OB -> OC)(A^ ok ) 

= ROT(by tt/2 in plane i)(A 
_*(3) 

— -^-book 

_Jt/4a(0) -Itt/4 

— C A book C 



(0) N 

book/ 



(94) 



Next rotate the book by 7r/2 in the xy-plane, as shown in fig 10(c) The line OB' is rotated into line 
OB" parallel to -OA, and OC into OC" parallel to -OB by the line product (OA,OB). Line OA' 
is not moved, OA" = OA'. 



ROT(by tt/2 in plane k)(A£ J ok ) = A 



book 



a k77/4 * ( 3 ) 



l book c 



-k7r/4 



3 £7r/4 ?tt/4 a (0) „-i^/4 p -k7r/4 



L book c 



(95) 



The book is now standing on its spine, front cover facing right. The three lines OA, OB, OC have 
moved to lines parallel to —OC, —OA, —OB, and the two results differ by a rotation by 2-7r/3 about 
the line x — y + z. 

The exponential expressions for the rotation operator can be rewritten in terms of the vector 
products, for example for fig [9] we have 



L book 



ROT(7r/2,i) (ROT( 7 r/2,k)(A[, o ) ok )) = A 

= ROT(y -¥ z) ( Rot(x - y 



)A (0) ) 



(y + z)y(x + y)xA| ) o ) ok x(x + y)y(y + z)/4 



(96) 



which equals eq(93) when r\ = — 1 because cos(|) = sin(|) = 4=. 
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4.4 Handedness conservation and choice of metric 

It is well known that the metric chosen for special relativity is subject to the choice of (+, — , — , — ) or 
( — , +, +, +), a choice known by some USA researchers as the East Coast versus West Coast choice 
- merely a matter of taste! Certainly the choice seems governed more by the one used by nearby 
colleagues than physical principles. Indeed, for most situations the choice is irrelevant. One reason for 
this is that in most physical calculations are done in the complex number field, C The effect of the 
sign of the metric disappears. However complexifying the algebra changes the topology, and it is in the 
topology that we should look for the differences. Our concern with the effects of topology is one of the 
reasons not to assume the complex number field in our development. 

One of us has argued earlier [T3j that to respect the cyclic properties of the triads (x, y, z) and (i, 
j, k) in C£(0,3) we must have the metric (— , — , — ) to preserve the handedness in the observed world. 
The first part of arguments regarding these special cyclic properties of the basis vectors of C£(0, 3) 
were presented above, at the close of subsection |4.2| To be explicit, physical operations in our physical 
3D space retain their handedness, that is, they retain the cyclic structure in the ordering of axes and 
planes in rigid bodies. It is only in the Clifford algebra C£(0, 3), that the sets (x, y, z) and (i, j, k) 
have a cyclic structure that is respected by the operations of the algebra. Some operations of C£(3, 0) 
take some cyclic orderings to their reverses. 

Another explicit example of when the sign of the spatial metric distinguishes the two algebras, 
is the dual operation defined by left (or right) multiplication of the unit tri-vector v, where v was 



defined in eq(79). (In the usual mathematical study of algebras there is some interest in operations 
that, when applied twice, are equivalent to the identity operation. Operators familiar to students in 
university courses in introductory mathematics include the inverse operation A~ 1 , matrix transposition 
and complex conjugation.) 

Use A* to denote the dual of an arbitrary element A formed by right multiplication by v, A* = Av. 
We call this dual the v-dual, or pseudo-scalar-dual, or spatial dual. Using the linearity properties, we 
need only study the action * on the basis elements, and obtain 

{l*,x*,y*,z*,i J , k ,v*} = {v,7?i,77J,77k, -x, -y,-z, 1} (97) 

The scalar and tri-vector are dual to each other, and each vector is dual to the bi-vector (or plane) to 
which it is orthogonal. It is only for 77 = — 1 that we have (A*)* = A for this simple definition of a 
dual. 

section [6] takes the study of the differences between C£(0, 3) and C£(0, 3) a step further by studying 
matrix representations of the algebras, and their corresponding groups. 

4.5 The conventional vector cross product 

The familiar cross product a x b of the Heaviside-Gibbs algebra is the spatial dual of the anti-symmetric 
part of ab, aAb= |( a b — ba) 

ax b = (aAb)* 

= -(aAb)v (98) 

Writing a x b out in components gives 

a x b = (a y b z - a z b y )x + (a z b x - a x b z )y + (a x b y - a y b x )z (99) 

where the z-componnent of the product is obtained from a x times b y and so on, cyclicly. For the wedge 
product, we get similarly 

a A b = {(a y b z - a z b y )\ + (a z b x - a x b z )j + (a x b y - a y b x )k (100) 

so that for example the i component of the wedge product is the x component of the cross product 

(a A b)|i = (a x h)\ x = (a y b z - a z b y ) (101) 

The key result from the above is that a A b is a pure bi-vector that represents an equivalence class 
of squares of the appropriate area in the plane in which a and b lie, whereas a x b is a vector that 
represents an equivalence class of lines of the appropriate length that are normal to that plane. 
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4.6 The arbitrary rotation axis 



Subsection |4.2| and |4.3| both used rotations in the plane of the given vectors. The rotations were 
expressed either in the form ROT(a — > a') which is defined as the rotation in the aa'-plane by the 
angle between a and a', 9 a > a , or in the alternative form ROT(by 9 a > a in the a A a'-plane). In 3D, the 
line a x a' is perpendicular to the a A a'-plane, and thus we may write it in the various forms 



ROT(a -► a') = ROT(by 9 a > a in the a A a'-plane) 

= ROT(by 9 a ' a about the line a x a') 
= ROT(by 9 a , a about the line (a A a')*) 



(102) 



the last expression follows from the duality between a line (or vector) and the plane (or bi-vector) 
normal to it. In the above we assume our standard convention of an anti-clockwise rotation from a 
to a'. The rotation ROT(a' — > a) is the inverse to ROT(a — > a'), but since all our rotations are anti- 
clockwise, the angle 9 a < a = 2n — 9 aa > . 

The minimum rotation angle 9 aa ' is the smaller of 9 aa ' an d 9 a 'a- We have cos# aa ' = a.a'/a 2 (We 
assume here that a and a' are the same length, a — a') and choose to have sin6* aa ' = a x a'/a 2 ||, so 
that < 9 < 7r. However in 3D there are many rotations, not just ROT(a' —> a) and ROT(a — > a'), 
that take a vector a into its image a'. The rotation by it about the vector a + a' that lies halfway 
between a and a' also suffices, as does the rotation by the appropriate angle about any axis lying in 
the (a x a', a + a')-plane. This plane is perpendicular to the vector a — a', see figure 11 and so is the 
(a — a')*-plane. 




> a xb 



a + b 



plane (a - b)* 



Fig. 11 The plane formed from a x b and a + b is perpendicular to a — b. The plane ab is vertical in this 
diagram and the rotation of a into b in this plane is a rotation through angle 9 a b about the horizontal axis 
a x b. The rotation through n about axis a + b also rotates a into b 



Any vector that is a linear combination of the above two vectors 

m = p(sl + a') + q(& x a') for any p, q, 6 



(103) 



may be used as the axis of rotation for a — > a'. The rotation angle is the angle between the projections 
of the vectors a and a' onto the plane m* . 



4.7 The general rotation of a rigid object 

Let us now use the 3D Clifford algebra that we have derived in this section, to derive a simple closed 
formula for the rotation of a rigid object, from one known position to another. 

The location and orientation of a rigid body, e.g. a book, in 3-space is given by specifying the 
location of three non-collinear points, e.g. A, B, C as in figure |T3) 
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plane m 4 




> axb 



g(a x b) 



Fig. 12 Rotating about the arbitrary axis m in the (a + b, a x b)-plane by the appropriate angle will take a 
to b. The angle needed is obtained by projecting a and b onto the plane m* 




Fig. 13 The position of a book is given by giving the location of any three points A, B, C if A, B, and C are 
not co-linear. 



As a preliminary, observe that the location of a rigid body can be uniquely specified by the location 
of precisely three non-collinear points of the body only because axis systems attached to rigid bodies 
retain their handedness under all those movements that are physically possible. 

If the book is moved then the three points will move to new positions A',B',C relative to the 
coordinate frame of the observer. The problem we wish to find a general solution for, is given the 
initial points A,B,C and the final points A',B',C, and that an arbitrary but known point R has 
moved to R' , find R . 

The motion can be described as a translation, followed by a rotation, see figure [14] It is by assump- 
tion a rigid body, so the lengths of the lines AB, BC, CA remain unchanged: \\A'B'\\ = \\AB\\, \\B'C'\\ = 
||.BC|| and ||C"t4/|| = ||CA||. We may choose the translation to be specified by the active action of the 
line AA', that is by the translation AA', and seek to find the rotation about A'. Let us find this in 
terms of the elements of the Clifford algebra with the origin at A'. Let a correspond to the line AB 
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(strictly a is the equivalence class [AB]), b correspond to AC, a' to A'C and a' to A'C, as in figure 

M 




Fig. 14 The movement of a book from points A,B,C to A',B',C' is described by the translation vector 
t = [A A'] and the initial vectors a and b, and final vectors a' and b'. 

The rotation that simultaneously rotates a into a' and b into b is about a line that is in both the 
plane (a + a', a x a') and the plane (b + b', b x b'). The first plane is the plane (a — a')*, that is the 
set of lines orthogonal to the line (a — a'). The second plane is the set of lines orthogonal to the line 
(b — b'). In the general case the line m that we need is therefore 



m= (a -a') x (b - b') 



(104) 



and the angle may be found by projecting either a and a', or b and b' onto the plane m* orthogonal 
to the line m. Thus 



c = axm = a — a.m/m 2 



(105) 



and 



c' = ai m = a' a'.m/m 2 



(106) 



are suitable vectors. Observe that c,c' 6 m*, and c x c' is parallel to m. In the special case that 
(a — a') is parallel to (b — b'), then m is zero. This corresponds to the rotations a to a' and b to b' 
being equal and we can choose c = a and c' = a'. 

Thus the operator that rotates the rigid object so that points A, B, C go to points A' = A, B', C 

is 



Rot(A, B,C^ A' =A, B',C) = RoT(a -> a', b -> b') 

= Rot(c — > c' in the cc'-plane) 
= Rot(c — > c' about the m axis) 



(107) 



where c and c' are given by eqs(105|[l06) 
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Combining all these results gives the final expression for the rotation of a general 3D multi-vector 
A associated with the rigid object as 

Rot(A, B,C -)• A' = A, B', C")(A) = Rot(c -> c' about m)(A) (108) 

= dcAcd/(c 2 d 2 ) (109) 

where d, the bisection of c and c' is given by 

d = c'c + cc' (110) 

4.8 Reference frames 

In earlier sections, and in this section, we showed how to set up an orthonormal coordinate system 
for each rigid body. Using this coordinate system we can then measure the location (the position and 
orientation) of other rigid bodies relative to that coordinate system. The operations of the Clifford 
algebra C£(0, 3) gave the mathematical transformation for the location of an object measured relative to 
one rigid body (one coordinate system) to the location of that object measured in any other coordinate 
system. 

To personalise this in the usual way, the observations of various observers are related to one another 
by making the appropriate adjustments (the Galilean transformations) to the positions and orientations 
of the observers' coordinate systems. Using the term 'frame of reference': the measurements of one 
observer, S±, of the locations of the points, lines, planes and volumes of other rigid bodies, using that 
observer's frame of reference, {pti,y 1 ,Zi), may be transformed using the operations of the Clifford 
algebra C£(Q, 3) to the locations of the points, etc., as measured by other observers, 52, S3, . . . using 
their various frames of reference, (xj, y,, %i) where i — 2,3,.... Some of those measurements are the 
locations of the frames relative to one another. One set of these measurements consists of the vectors 
ai2, t>i2 and C12, being the position of three points A, B, C (for example the origin <3 2 , and the ends of 
the unit lines 2 X 2 and 0-2X2) that describe the position of the origin and the orientation of frame S2 
relative to the frame 5i . The position of the origins of two frames are related via a translation. Their 
orientations are related via a rotation 

Trans(5 2 ->■ 5i)0 2 = O u (111) 

Rot(5 2 ->■ 5i)(x 2 ,y 2 ,z 2 ) - (xi.y^zx) (112) 

The transformation of the location of (say) rigid body 53 measured in frame 5 2 as A3 2 , to frame 5i 
(measured as A31 is thus first a translation of the origin of frame 5 2 to frame Si, and then a rotation 



of the form given by eq( 109 1 of the previous subsection. 

Rot(5 2 -> 5i)Trans(5 2 -► 5i)A 32 = A 31 (113) 

Of particular interest, and fundamental importance, is the reciprocity of this relationship. It follows 
from the vector space (homogeneity) properties of translations, and the Pythagorean metric (which 
includes the isotropy of space). The position of the origin of frame Si in the frame 5 2 , and the 
orientation of frame Si in the frame 5 2 are the inverses of the above 

Trans(5 2 -> Si) = Trans(5i -» 5 2 ) _1 (114) 

Rot(5 2 -> Si) = Rot(5i -> 5 2 ) _1 (115) 



and consequently the inverse of equation (113) is given by 



Rot(5 2 -> 5i)Trans(5 2 -> 5i)A 32 = [Rot(5 2 -> 5i)Trans(5 2 -> 5i)] _1 A 3i (116) 

= Trans(5 2 -> 5i) _1 Rot(5 2 -> 5i) _1 A 3 i (117) 

Thus we see that the Clifford algebra C£(0, 3) enables the measurements of observer 5 2 to be 
transformed to those of observer Si , and conversely. 
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4.9 Concluding remarks - What have we achieved? 

This section has extended the consequences of homogeneity and isotropy from a 2D world to a 3D 
world. Homogeneity and the properties of translations of rigid objects led to a rather simple extension 
of the addition properties of lines and the corresponding vector space properties. The isotropy of 3D 
space, and the rotation properties of rigid objects led to a richer set of properties, properties that are 
described by the Clifford algebra C£(0, 3). 

As an example of the power of a coordinate-free formulation of the Clifford algebra, we obtained the 
operator that rotates a rigid object from a known position (specified by the location of three points) to 
a second position (specified by the new location of these three points). It may be that this result has 
not been previously found, certainly the authors have not found such an expression in the literature. 

We demonstrated that the maintenance of cyclic structures of sets of basis lines and sets of basis 
planes, namely the parity conservation properties of allowable physical movements of rigid bodies, 
requires the use of the Clifford algebra C£(0, 3) and not the Clifford algebra C£(3, 0). 

Parity conservation in physical 3D space would seem to be a property of homogeneity and isotropy 
in nD space. In ID, rigid objects can be modelled as beads on a wire or trains on tracks, and while they 
can be moved (translated) backwards and forwards, they cannot be turned around. To do so would 
require turning the object over, using a second dimension of physical space. 

In 2D, we modelled rigid objects as sheets of section on a desktop. The objects can be translated 
in two orthogonal directions, represented by the unit vectors x and y, and the expressions — x and — y 
make operational sense as independent translations. But a rigid 2D object cannot be rotated so that 
only one of a pair of orthogonal lines (say OA and OB) becomes its negative, if we have — OA = AO 
then —OB = BO also. To have only one would require turning the paper over, using the third dimension 
of physical space. In our 3D space of rigid bodies, there is no evidence of another spatial dimension, 
and no evidence that parity of rigid objects is not an absolute conservation law. Note: parity does not 
seem to be conserved in some experiments involving the weak force, however the weak force does not 
act on rigid bodies. The time parameter adds another dimension to the mathematical description of 
our world, but it differs from the spatial dimensions in many ways, as we study in the next section. 

Hamilton 24,25] spent many years seeking a generalization of the algebra of complex numbers that 
seemed, via the Argand diagram, to give a good mathematical description of the geometry of the plane. 
Complex numbers gave the mathematics describing translations in 2D, as addition of pairs of numbers 
for coordinates in the x and y directions. Complex numbers describe rotation by multiplications. 
However the structure of 3-complexes that he sought does not exist, but he did find the necessary 
four-dimensional generalization, which he called the quaternions. 

The story of Hamilton recognizing what was needed is part of the oft quoted folklore of mathematical 
discovery. He reports that it came to him "in a flash" while walking with his wife along a Dublin 
canal on a Sunday afternoon. The generalization for 3D of the "ordered pair" or 2-complex needed 
for 2D geometry, was to an "ordered 4-tuple" or "quaternion" . The quaternion components are the 
basis of Hamilton's non-commutative algebra. Hamilton's quaternion algebra was the first formal non- 
commutative algebra, and represents the non-commutativity of rotations in 3D. 

Hamilton was en route to developing the appropriate algebra for describing the geometry of space. 
In his lectures to the Dublin Royal Society in 1853 25J Hamilton carefully distinguished between polar 
vectors (which he called lines) and axial vectors (which he called versors). Polar vectors describe the 
positions of the points of objects, and also describe translations. Axial vectors describe the orientations 
of (non-point) objects and also describe rotations. He then proceeded (page 71) to write both polar 
vectors and axial vectors in his axial basis which he labeled by the letters i,j,k. Unfortunately for the 
development of the subject he fails to maintain this distinction, and writes 

And I conceive that we may now legitimately, and with advantage, avail ourselves of the same 
analogy, or of the theorem to which it corresponds, to dispense with that symbolic distinction 
which has been above observed, between the three quadrantal versors i, j, k, and the three lines 
i, j, k, which have respectively the directions of their three axes. [Emphasis as in the original.] 

We presume he made this identification so as to keep his algebra small, had he retained the distinction 
he would probably been led to the conclusions of Clifford[15]. The appropriate algebra has the three 
lines i, j, k, which are now called the basis polar vectors and we write as x, y, g. It also has the three 
versors i, j, fc, which are now called the basis axial vectors and we write as i, j, and k. The complete 
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algebra closes with the addition of two more basis elements, the scalar, 1, and an element we represent 
as v which relates to a basis volume element. Thus to describe 3D geometry accurately we need to use 
the eight dimensional algebra which we label C£(0, 3). 

Hamilton proved that the axial vectors i,j and k square to —1, and that they form his famous 
quaternion algebra 

i = j = k = ijk = — 1 

However the incorrect identification between the versors i,j,k and the lines i, j, k continues in the 
labeling, by many physics texts, of the polar vector basis elements as i, j, and k where i x j = k. 
This product is correct for axial vectors but not for polar vectors. This unfortunate identification by 
Hamilton has to be patched up by ignoring the distinction of polar and axial vectors, or equivalently 
by identifying lines with planes (or translations with rotations). Put simply, in most approaches since 
Hamilton a plane is identified with the line that is normal to it. 

This conflation of lines (or unit vectors) x, y, z and planes (or unit rotators) i, j, k continues to this 
day. Simon Altmann[33] gives the fullest history of this mess that we are aware of. We recommend 
that readers who wish to pursue some of the history of the geometric product and Clifford algebras 
refer to the review by Altmann|26j. and that they also read the lectures by Hamilton 25j. 

A further consequence of the polar-axial identification is that the vector algebra is too small to 
describe the geometry and the physics contained in that geometry. This evidences itself in many ways. 
The first one we have discussed above — there is the need in the usual 3D algebra to use complex 
numbers, effectively a six dimensional space, to describe rotations. The eight dimensional Clifford 
algebra contains all we need without complex numbers. Second is the so-called proof that quantum 
mechanics needs complex numbers. In a future paper we plan to review the argument as presented by 
Sakurai[27], to conclude that yes, you do need more than a three dimensional algebra, but no, complex 
numbers are not needed if both polar and axial vectors are used. Our argument is, in brief, that the 
complex number basis C 3 x, ix, y, iy,z,iz can be used for some of the geometry of 3D, if used with 
care, but the basis x, i, y, j, z, k is a better description of the geometry of the physical 3D world. 

5 Time and the Speed of Light, the Algebra of Spacetime 

If the assumptions and axioms up till now are accepted, then their extension for spacetime seems 
trivial. It was easy to extend homogeneity (section [2]) and isotropy (section [3]) from 2D to 3D (section 
EJ). However, time is different from space in many ways, Our preceding analysis was based on the 
translational and rotational invariance of rigid bodies. Clocks are not rigid bodies, although most 
definitions of a clock rely in part on rigid bodies - the swing of a pendulum, the oscillation of a crystal, 
the bouncing of light between mirrors held a fixed distance apart. 

The translations and rotational properties of rigid bodies, and reference frames defined by rigid 
bodies, led us to the conclusion that the Clifford algebra C£(0, 3) over the field of rational numbers is the 
mathematical structure to transform from one inertial reference frame to another, the measurements of 
position and orientation of rigid bodies. It may seem that extending the arguments to spacetime would 
be as trivial as the extension from 2D to 3D space. Rather, the authors have found this section the 
most difficult, both to understand what we wish to write and also to write it clearly. The reason for this 
would seem to lie in the fact that the conventional derivation of Lorentz and Poincare transformations 
assume more than what is needed. Correspondingly, the literature is full of paradoxes and seemingly 
unsolved problems in special relativity, see for example the collections of papers in the conference 
proceedings [28] or papers by Selleri [29ll30| . Some, like the twin paradox, rely on confusion between 
inertial objects (the Earth bound twin) and the accelerated twin. Some argue that paradoxes can 
only be resolved by considering that the synchronisation of separated clocks is related to the one-way 
speed of light. In this section we retain the homogeneity and isotropy of space and imbed this into 
an assumption about the one-way speed of light. By making the minimal assumptions about these 
matters, this section aims to come to the strongest conclusions about spacetime transformations. We 
conclude that the algebra to describe them is the Clifford algebra C£(l, 3). 

The subject of this section is to extend the work of the previous three sections to include time, 
not as a mere parameter, but as a fourth dimension. In particular we seek to understand how to 
mathematically describe the motion of rigid bodies in spacetime. In general, the motion of a rigid 
body in spacetime can be described by a sequence of events. The term 'event' is the generalisation 
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to spacetime of the term 'point' of space from the previous sections. Saying 'event' is an alternative 
to saying 'point in spacetime.' As we shall see, spacetime needs exactly four linearly independent 
parameters to specify position, up from the three needed for 3D space. In Newton's First Law, the 
sequence of events is the set of locations of the points on (or in) rigid bodies in space and parameterised 
by time. 

In sections [2] and |3] of this series we considered observations of the movement of rigid objects in a 
2D toy world. The rigid objects in these sections were sheets of paper, moving about on top of another 
rigid object, the desktop. The points, lines, and the pieces of paper could be described as being at 
particular coordinates on the desktop at different values of a third parameter. We had the choice 
between two options for this third parameter to describe the location of the objects as they moved. 
We could plot 2D position against values of a time parameter, or against values of a height parameter. 
However neither the parameter 'time', nor the parameter 'height' needed any scale for that discussion. 
All that was needed was some means of characterizing the sequence of the positions of one rigid body 
relative to another. When extending these arguments to 3D in section |3J we had only the parameter 
'time' to characterise the various positions of the book or other 3D objects. Once again though, no 
scale was attributed to the parameter. 

The first task in this section is to discuss the concept of equal times, and to associate a scale to the 
time parameter by developing a time measuring stick (known as a 'clock'). The means to do this is to 
use natural systems that provide clocks, so |5.1| discusses such natural systems and shows that we may 
treat distances in the time direction (time intervals) in similar ways that we treat distance in any one 
of the three linearly independent space directions. The invariance properties of clocks, that is the fact 
that many of our world's clocks behave in the same way yesterday, today and tomorrow, corresponds 
strongly with the homogeneity of space - rigid bodies do not change when moved in space. There is 
however an important difference between translations in time and translations in space. While we can 
move our rigid objects back and forth in space (within the limits imposed by our experiments), we 
cannot move our rigid objects back and forth in time. 

It may help the reader to use the term 'timeline' here. Timelines do not always have a scale attached, 
they often just show the time ordering of events, but clocks and their invariance always allow us to 
attach a scale. This invariance property allows us to define corresponding algebraic entities, vectors in 
the one dimensional vector space that is the time direction of spacetime, and also the unit vector for 
the time direction, t. 

Einstein [31] caused a major revision to the way we view time and revised our view of simultaneity. 
Einstein showed that we have a choice of assuming that clocks measure the same time intervals for all 
inertial observers, or that the measured speed of light is the same for all inertial observers. Experiment 
shows it is the second option that is correct, at least for rigid body frames. This changes the way 
we interpret what we observe, in particular what we consider simultaneous. Our second key task is 
therefore to explore some consequences of the fact that the speed of light in a vacuum is the same for 
all inertial rigid body observers. 

The relationship of the time axis to the space axes is given by the Lorentz metric, just as the 
relationship of the space axes to each other is given by the Pythagorean metric. The Lorentz metric is 
shown to follow from the invariance of the speed of light, just as the Pythagorean metric was shown 
in sections [3] and [4] to follow from the isotropy of space, being the invariance of rigid bodies under 
rotation. 

The previous sections were limited to exploring the geometric consequences of the concept of straight 
line motion of a rigid body as used in Newton's First Law. We have shown that the homogeneity and 
isotropy of our 3D space are well described by the Clifford algebra C£(0, 3). However this is only part 
of the statement of Newton's First Law. Not only does the absence of forces lead to straight line 
motion (which we have seen is not a simple concept), but the motion has constant speed, or in other 
words is 'uniform.' This section looks at how to extend the Clifford algebra Ci(0, 3) to incorporate our 
knowledge of the relationship of the time parameter to the three spatial parameters. 

We find that the 'distance' between events in spacetime is given by Lorentz' generalisation of 
Pythagoras' result. Section [4] used the isotropy of space to compare the length of measuring sticks in 
different directions, and thereby to choose unit vectors to be of the same length ||x|| = ||y|| = ||z||. The 
concept of 'distance' in spacetime arising from the constancy of the speed of light enables us to expand 
the four-dimensional vector space to an associative algebra of dimension 2 4 = 16. This algebra describes 
much more general transformations than the translations described by a vector space. We explore this 
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and find that it contains the transformations 



algebra, the Clifford algebra C£(l,3), in subsection 5.5 

of Lorentz and Poincare [32], many of which are transformations between measurements of different 

observers, and a few of which describe physical operations on rigid bodies or on clocks. 



5.1 Clocks and the uniformity of time 

The previous sections have made much use of the homogeneity of space and the concept of a rigid body 
to define measuring sticks to measure length. The measuring sticks can be translated (and with isotropy, 
rotated) to compare lengths of objects. Time is rather different, as before the invention of the fob- watch 
in the sixteenth century and later the wrist watch, we could not 'pick up our time' and compare it 
with some one else's. Human use and understanding of time was based on the day-night, lunar and 
yearly cycles. All time measurements shorter than a day were subject to considerable variability and 
inaccuracy, and more or less unrepeatable. Excepting sun dials, water clocks and swinging chandeliers, 
and of course one's pulse or heart beat, time was difficult to measure. Prior to mechanical clocks there 
was nothing for measuring time that was analogous to the rigid objects that provide reproducible 
measuring sticks for space. 

Today it is quite different. Standard wrist watches have an accuracy of better than seconds per 
day, a few parts in 10 5 or so, and we are used to computer clock frequencies of gigahertz, not only the 
0.2 to 2 hertz of a chandelier or one's heart. Time is now the accurate measurement, defined by the 
period associated with the cesium atomic clock. Distance is now defined by wavelength of light, as a 
product of the (assumed, but well tested) constant speed of light and the frequency of light emitted 
by the appropriate atoms. 

Modern clocks, such as used for laboratory measurements, are based on atomic phenomena which 
typically have a 'tick' of 10~ 15 s and an accuracy of around 1 : 10 -20 . Such short times are beyond 
the comprehension of the proverbial 'person in the street', who is perhaps limited to a minimum time 
interval, i m i n , of a millisecond, 10 _3 s. However high energy physics experimentalists are familiar with 
particle lifetimes as short as t m j n = 10 -24 s. Likewise a child's perception of a longest time interval time, 
*max is a few years, or 10 7 s. Cosmologists have their t max as the age of the Universe, iunivcrsc ~ 10 17 s. 
Physicists thus consider the ratio £ m axAmin of about 10 41 

The invariance properties of clocks, that is the fact that many of our world's clocks behave in the 
same way yesterday, today and tomorrow, corresponds strongly with the homogeneity of space - rigid 
bodies do not change when moved in space. It is of course an assumption that clocks will behave the 
same way tomorrow as they did yesterday, but it is an assumption that can be tested in 24 hours time. 
We have good records of how clocks behaved in the recent past, perhaps for several hundred years, 
and indirect evidence that (many) clocks have not changed for billions of years. 



5.2 Time as an axis of 4D spacetime 

Although our perception of time is perhaps controlled by internal clocks in the body, such as the 
beating heart and electrical and biochemical processes in our brain, it is clear that time has many 
similarities to position. It is reasonable therefore to treat time as a fourth coordinate, that is to treat 
spacetime as a four dimensional vector space. We can compare the length of time intervals in our 
laboratory by first choosing an origin O and consequently choosing our 'time measuring stick', a clock 
with a 'tick', OT starting from the origin O, of length Is, which in turn is calibrated by our cesium 
atomic clock. The time coordinate is to be regarded as a 'coordinate', 'parameter' or 'direction' that is 
linearly independent of the three space 'coordinates', 'parameters' or 'directions'. We add to the three 
orthonormal spatial measuring sticks (OX,OY,OZ) and the three orthonormal unit vectors (x, y,z), 
the time measuring stick OT and the unit vector t. Just as x is the class of all lines equivalent by 
space translation to the unit spatial measuring stick, OX, that is x = [OX], so is t the class of all 
time intervals equivalent by time translation to the unit time interval, OT, namely t = [OT]. There 
is however an important difference between translations in time and translations in space. While we 
can move our rigid objects back and forth in space (within the limits imposed by our experiments), 
we cannot move back and forth in time. The reason for this brings us back to the concluding remarks 
of the previous section. Just as the handedness (parity) of rigid bodies in space are conserved, so the 
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handedness of time must be conserved also. Because we have only a single time dimension, the direction 
of time cannot be reversed via any physical operation. Thus both the handedness of space and the 
handedness of time for all rigid bodies are conserved by all physically allowed operations. An event a in 
spacetime can now be labelled in terms of the four basis vectors (t,x, y, z) describing the coordinates 
of the event relative to the origin of the reference frame, or describing translations of elements of the 
linear space. 

It seems now an appropriate time to introduce the usual notation for unit vectors in special and 
general relativity, and in the study of Clifford algebras. The initial letter for the number one in German, 
em, is a common choice. The four basis vectors are 

e<j = t 

e\ = x 
e 2 = y 
e 3 = z (118) 

which we index by Greek letters, 0, 1, 2, 3, and use as usual Latin letters for the spatial indices 1, 2, 3. 
An event a measured in (coordinatised in) frame Si is thus 

a = aot + oix + a 2 y + Q.3Z 
= a e + a x e x + a 2 e 2 + a 3 e 3 
= a e + aie t 
= a M e M (119) 

following the usual convention of using non-bold Latin font for 4- vectors (vectors in spacetime). An 
exception to this is that for the basis vectors t,x, y, z, we retain the 3D bold-hat notation. In the 
above, and in the following we use the usual 'Einstein summation convention' whereby doubled indices 
are summed over. However because we are using a Clifford algebra, we do not need to use raised and 
lowered indices to take into account the metric - the metric is built into the basis vectors. 

The time measuring stick is linearly independent of the three space measuring sticks, and we now 
have the concepts to fully understand Newton's First Law (see our discussion of this matter in section 
[2]). But how do we determine whether or not the time measuring stick is orthogonal to the spatial 
ones? Indeed, what does it mean for time to be orthogonal to space, since we cannot rotate space 
into time? Prior to 1905 time and space were seen as independent, but Einstein showed that there 
was a way to generalise the isotropy considerations of section [3j He found a way to interpret certain 
measurements as 'rotations' in spacetime, and thus a way to find a spacetime distance measure that 
was the generalisation of Pythagoras. A modern variant of his argument can be found in introductory 
physics texts. We review this in |5.4| We offer a simpler derivation and explore the result in detail in 
1531 



5.3 Relative speed of rigid bodies 

We want to use an inertial rigid body (and its clocks) to define a frame and measure other rigid bodies 
with respect to this reference frame. In general these other rigid bodies can be moving with uniform 
velocity, undergoing linear acceleration, or rotating. For the cases where there is any acceleration, 
either linear or angular, the associated forces acting on the rigid body will have to be considered. 
Particularly in the case of rotating rigid bodies, centrifugal (and Coriolis) forces arise. Although we 
have set up the mathematics to deal with rotating frames and accelerating frames, in this section we 
are concerned only with inertial frames. Whereas frames are allowed to be moving at a constant speed 
and two frames are allowed to be rotated with respect to each other, that is have different orientations, 
we do not allow a frame to be accelerating or rotating. Such a frame would not be an inertial frame. 
In future work we plan to use the Clifford algebra to account for any accelerations that are being 
experienced by observers. This will allow us to deduce the extension of the properties of rigid bodies 
and clocks to accelerated motion. We further anticipate that this future work will resolve many of the 
paradoxes, such as the twin paradox, and what many 28J regard as open problems in special relativity. 
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The speed, v 2 i, of rigid body S2 relative to rigid body Si, is defined as the ratio of distance traveled, 
£1, as measured by Si, to the travel time, t\, also as measured by Si. We have 

W21 = ti/ti (120) 

If S2 is in uniform motion relative to Si, then if Si takes another measurement v 21 — ii/t'i, we find 

«21 = »21 ( 121 ) 

In addition to rigid bodies moving uniformly with respect to each other, the orientations of their 
frames may also be rotated with respect to one another. As we have noted before, the isotropy of space 
allows the rotation of a body, here £2, relative to the axis system of another, Si, to be written in 
terms of the product operation between vectors. In section [4] we showed how to derive the plane of the 
rotation given the initial and final positions of three points of a rigid body. That calculation gave the 
rotation plane, and the angle in that plane, as a bi-vector. 

5.4 The speed of light and the Clifford algebra Ci(l, 3) 

Consider now a short pulse of light emitted at event Ei = (tit + xix + yiy + ziz) and received at 
event E 2 = (t 2 t + x 2 ~x. + y 2 y + z 2 z), both measured in the frame of a rigid body Si in inertial motion. 



Extending the definition, eq(120), of relative speeds of rigid bodies, to the speed of light, c, gives for 
this situation 

c = W(*2-*i) (122) 

where £21 is given by Pythagoras 

I21 = y/fa - x x f + (j/ 2 - Vl f + (z 2 - zif (123) 

Eliminating £21 and rearranging gives 

c 2 {h - ix) 2 - (x 2 - xif ~ (is - yif - (z 2 - zif = (124) 

It is experimentally observed that in a frame Si, all measurements of c give the same value, 
independently of the location of the emission and absorption events. Furthermore, if these two events 
are observed by observers in the frame of another rigid body S2, then the same result holds, for the 
same value of c, regardless of the position, orientation or speed of the two rigid bodies relative to 



each other. In other words cq( 124 ) is the generalisation of Pythagoras to spacetime distances between 
events connected by light. 

The speed of light is rather different to the speed of a rigid body. Although Einstein reports [55] 
that he found the thought experiment of imagining that he was traveling with a light wave, as a key 
step in coming to his Special Theory of Relativity, it is experimentally observed that no rigid body 
ever travels at the speed of light relative to another rigid body. Thus we cannot define the motion of 
a rigid body relative to light, only the speed of light relative to a rigid body. The transformation laws 
between measurements made using clocks and rigid bodies derived in the next subsection explains this 
- explains in the sense that we must modify at least one of the assumptions and at least one of the 
axioms of our sections if we were ever to observe rigid bodies traveling at the speed of light. 

In order to find the algebraic product relationship between the basis vectors, we can repeat the 
argument from s ectio n Il[3J We want the free product (c(t 2 — ti)t + {x 2 — xi)x+(y 2 — yi)y + (z 2 — .zi)z) 2 



to reproduce eq(124). After a few simple steps analogous to the 2D and 3D cases, we deduce that we 
need in addition to the product rules for the spatial unit vectors, eqs(2-6) of section IIl|4j the rules 



t 2 = 


1 


tx = 


xt 


ty = 


yt 


tz = 


-zt 



(125) 



The rules eqs(2-6) of III, together with eqs(125l define the Clifford algebra C£(l,3). 



:;<s 



The Clifford algebra of spacetime Cl(l, 3) has other basis elements, they are constructed from 



products of the defining elements of eq(118|. The extra elements we write as 



6/^z; — e^ej, C(,e^ &i/[i 

e^p = €pe v e p — e„ PM and other cyclic permutations of \ivp 
= — e viip and other non-cyclic permutations 
e = eoe\e%e^ = eoi23 and cyclic permutations 
= — eio23 and other non-cyclic permutations (126) 

Using this notation we may readily expand out the square, a 2 , of the vector a that represents the 
distance between two events as 

a 2 = (a e + aid + a 2 e 2 + a 3 e 3 ) 2 
= a^e 2 , + (a^) 2 + a aje i + aia ejo 
= a — a x — a 2 — a 3 (127) 



since eoi = — e^. By choosing eo = t, the parameter a® in the first of eqs(119) is related to the time 
coordinate t by the scale factor c, a® — ct. 

Observe that the spacetime Clifford algebra C£(l, 3) contains the 16 linearly independent elements 

the scalar, 1 = 1 
the 4 vectors, e^ = eo, ei,e2, e 3 = t,x,y,z 
the 3 spatial bi-vectors, e^ = e2 3 ,e 3 i,ei2 = i, j, k 
the 3 spacetime bi-vectors, e^o = eiOj^Ojeso 

the 4 tri- vectors, e M e = e 123 , e 023 , e 03 i, e 012 
the quadri- vector or spacetime pseudoscalar, e = eoi23 (128) 

where the six elements 1, eo, eio, e2o, e 3 o, and ei2 3 square to +1 and the ten elements ei, e2, e 3 , e2 3 , e 3 i, 
ei2, e 023, e 03 i, e i2 and e square to —1. The finite group C^ group (l, 3) consists of 32 elements, these 16 
elements and their negatives. 

Wc have used the invariance of the speed of light, both with respect to measurements in the frame 
of one inertial rigid body, and with respect to measurements in differing inertial rigid bodies, to deduce 
the Lorentz invariant metric of spacetime. This metric is the generalisation of the Pythagoras result for 
orthonormal axes x, y and z to include t. Pythagoras says that the length squared of the line on a rigid 
body in 3D is the sum of the squares of the components with respect to orthonormal reference axes, 
and is invariant under motion of the rigid body. This incapsulates the homogeneity and isotropy of 
3D space. The Lorentz metric extends Pythagoras from 3D to the 4D of spacetime by giving a precise 
meaning to the statement that the time axis of an inertial rigid body is orthogonal to all three space 
axes. Further, the Lorentz metric uses the speed of light, c, as the constant relating the length of the 
space measuring sticks (chosen to be 1 metre) to the time measuring stick (chosen to be 1 second). 
Although we have not chosen units where c — 1, we have chosen the vector t to be unit, t = 1. 



5.5 Spacetime events and simultaneity 

We employ the techniques introduced in section [4] for transforming between reference frames in a 
relativistic setting and obtain the Lorentz and Poincare transformations as a result. We begin by 
considering two frames Si and S 2 representing two inertial rigid body observers. For simplicity we 
assume that the two frames are not rotated with respect to one another and that initially at t = the 
origins of the two frames coincide, that is 0\ — 2 . By this we mean that the spacetime coordinates 
of the initial event are the same in both frames 

Si = (0,0,0,0) =0t + Ox + 0y + 0z (129) 

52 = (0, 0, 0, 0)' = 0t' + Ox + Oy + 0z' (130) 
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We consider now how the coordinates change in the two frames as S 2 moves with respect to Si in 
the x direction with speed v. The position of Si as measured by observers in Si is 

Si{Si) = crf = si (131) 

because observers in Si see their clocks ticking. Similarly, the position of S2 as measured by observers 
in Si is 

Si(S 2 ) = cti + vtx = s 2 (132) 

Therefore the transformation from the the first frame Si to the second frame S 2 is given by a spacetime 
rotation Rot (Si — > S2) 

Rot(Si -> S 2 ) = Rot(crf -> erf + vrx) = Rot(si -> s 2 ) (133) 

Now consider a general spacetime event P measured by Si and S2 . The coordinates of P are given 

by 

P = cti + xx + yy + zz, (134) 

= ct't + x'x! + y'y + z'z (135) 

respectively. Given the coordinates of P in the first frame Si , the coordinates in the second frame S 2 
are given by 

S 2 (P) = Rot(Si -> S 2 )(Si(P)) (136) 

= {sis 2 ) 1/2 Si(P){ S is 2 y 1 ' 2 (137) 

;Si(si + s 2 )Si(P)(si+s 2 )si (138) 



(si + s 2 ) 



where we have used the expression (38) of sectionk3Jfor the square root in terms of the unit vectors si = i 
and s 2 = 7(1 + /3x) where /3 = u/c and 7 = l/yl — v 2 /c 2 . This expression can be evaluated directly, 
but the algebra is simplified by noting that for rotations in the tx plane, the y and z components are 
unchanged and using the generalization of equation (29) of section El we have 

S 2 (P) = §is 2 (cti + xx) + yy + zz 

= 7t(t + /3x)(crf + xx) + yy + zz 

= 7(crf — (3ctx + xx — x(3t) + yy + zz 

= 7c(i — vx/c 2 )i + 7(2: — wt)x + yy + zz (139) 

Consequently 

t' = 7(t - vx/c 2 ) 
x = 7(0; — wi) 

?/' = 2/ 

z' = z (140) 

These are the standard Lorentz transformations. 

For simplicity we assumed that the origins of Si and S 2 coincided at t = and that the spatial 
orientations of the frames are equal. For the case where the orientations are not the same, the above 
calculations still hold but one has to introduce a rotation (see section [4| to align the frames. For the 
case where the frames do not share a common origin (both in space and time) , the origins of the two 
frames will be connected via a translation in spacetime 

Trans(S 2 ->■ Si)0 2 = 2 + T = Oi (141) 
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where T = i t + x x + y a y + z a z is a vector. So if at t = (in frame S2), the location of the origin of 
frame Si as measured by an observer in frame S2 is given by 

S 2 (Oi)\ t =o = x x + y y + z z (142) 

then the transformation between the two frames is given by 

t' = t + j(t - vx/c 2 ) (143) 

x' = x + y(x - vt) (144) 

y' = ya + y (145) 

z' = z + z (146) 

These are the Poincare transformations for parallel spatial axes. 

5.6 Concluding remarks - What have we achieved? 

In this section we have extended the work of the previous three sections to include time. The invariance 
properties of clocks corresponds strongly with the homogeneity of space. We associated a scale to the 
time parameter by developing a time measuring stick OT (known as a 'clock') and the unit vector t. 
We may treat distances in the time direction (time intervals) in similar ways that we treat distance 
in any one of the three linearly independent space directions. An event a in spacetime can then be 
labelled in terms of the four basis vectors (t, x, y, z). The vector space properties of the time dimension 
have been shown to be analogous to those properties derived from the homogeneity of each of the three 
dimensions of physical space. Time can be treated as a fourth coordinate, that is physical spacetime 
corresponds to a four dimensional vector space. The time coordinate is a 'coordinate', 'parameter' or 
'direction' that is linearly independent of the three space 'coordinates', 'parameters' or 'directions'. 

The relationship of the time axis to the space axes is given by the Lorentz metric, just as the 
relationship of the space axes to each other is given by the Pythagorean metric. The Lorentz metric 
follows from the invariance of the speed of light, just as the Pythagorean metric follows from the 
isotropy of space, being the invariance of rigid bodies under rotation. 

It is a guiding general principle of science that observations are essentially independent of the 
observer. Einstein's 1905 paper [3T] uses a more specific form of this general principle, "That physics 
is the same for all inertial observers" (translation as used by [3]). In this section we have assumed 
that all inertial observers measure the speed of light, c. All of this is well known and standard. Our 
argument contains some novelty in the following results. 

If the Pythagorean metric is assumed to apply to one rigid body frame, then the assumption of 
homogeneity means it applies to all rigid body frames, likewise, we need only assume that the speed of 
light c is invariant in one inertial rigid body frame for we can use the homogeneity of spacetime, and 
the isotropy of space to deduce that the Clifford algebra C£(l, 3) gives the transformation laws for all 
spacetime measurements between all inertial rigid body frames. 

We have extended the Clifford algebra C£(0,3) to incorporate our knowledge of the relationship 
of the time parameter to the three spatial parameters. The concept of 'distance' in spacetime arising 
from the constancy of the speed of light enabled us to expand the four-dimensional vector space to 
an associative algebra of dimension 2 4 = 16. We found the algebraic product relationship betwee n the 
basis vectors by repeating the argument from section [3] The rules eqs(2-6) of section H and eqs( |125| ) 
define the Clifford algebra C£(l,3). 

We explored this algebra in subsection |5 . 5| and employed the techniques introduced in section [4] for 
transforming measurements of events (and by extension the notion of rigid bodies) between inertial 
reference frames in a relativistic setting to obtain the Lorentz and Poincare transformations. 

This paper considers only inertial rigid body frames. Whereas frames are allowed to be moving at 
a constant speed and two frames are allowed to be rotated with respect to each other, that is have 
different orientations, we do not allow a frame to be accelerating or rotating, such a frame would no 
longer be an inertial frame. Work is needed to use the Clifford algebra to account for any accelerations 
that are being experienced by observers. This will allow us to deduce the extension of the properties of 
rigid bodies and clocks to accelerated motion. We further anticipate that this future work will resolve 
many of the paradoxes, such as the twin paradox, and various open problems (see for example |28| ) in 
special relativity. 
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6 Algebraic Structure of C£(l, 3) 

We have argued that the Clifford algebra C£(l, 3) is the appropriate algebra to describe spacetime. We 
have shown that the rational numbers Q are needed as the field over which the algebra is defined. In 
this section we explore further the algebraic structure of spacetime. 

Matrices are a natural and very useful way to study the properties of algebras. In this section we 
review the matrix representations of Clifford algebras C£(l,3) and C£{2>, 1) and some of their lower 
dimensional subalgebras. For representations of C£(j>, q) up to p + q — 7 and for some general results, 
the reader is referred to Lounesto [34] , 

Although matrix representations are a useful tool for studying the Clifford algebras of space and 
spacetime, and indeed algebras in general, it must be remembered that the Clifford algebras retain a 
stronger link to the points, lines etc. of spacetime than matrix representations which are only deter- 
mined up to a similarity transformation. 

There are two important points we must consider when looking for a matrix representation. First 
the dimension of the algebra is normally no more than the number of independent components of the 
matrices. Second, it is important to find connections between the geometry of the Clifford algebras and 
the geometry which is present in their matrix representations. This is a powerful incentive to consider 
mainly matrix representations over the rational or real numbers. 

One claimed weakness [3] of the Clifford algebra C£(l,3) in being able to mathematically describe 
physical reality is that the algebra is not a division algebra, meaning that there are elements of the 
algebra other than zero for which no inverse can be found. There are in fact very few linear spaces which 
admit the structure of a division algebra; the algebra of the rationals, the reals, the complex numbers 
and the quaternion algebra are examples. The Clifford algebra C£(l,3) is not a division algebra as 
there exist many elements A for which no inverse A -1 can be defined. 

It has been shown by van der Mark and Williamson (35] that the areas of the algebra where the 
inverse does not exists, that is where division cannot be defined, are where certain invariant quantities 
become zero, for example on the light cone. These areas are referred to as null-hyperplanes because 
they correspond to null multivectors and correspond exactly to cases of physical interest. The fact that 
there does not exist an inverse for every element is therefore not a weakness but a necessity because 
the breakdown of invertibility in these areas matches the behavior of nature. 

Later in this section, we confirm some of the results found in 35j, but not by means of defining 
a new conjugate, but by using the matrix representations of the spacetime Clifford algebra C£(l,3). 
The use of matrix representations make it a straightforward task to determine which elements of the 
algebra are or are not invertible. Given that an element is invertiblc, it is then straightforward to 
calculate its inverse. The invertibility or non-invertibility of multivectors give us physical insight into 
conserved quantities and limitations of physical systems. 

6.1 C£(1,0) and C£(0, 1) 

The Clifford algebras C£(1,0) and C£(0, 1) each have two basis elements, 1 and e x satisfying, 

l 2 = 1, e 2 = +1 (147) 

for C£(1,0) and 

l 2 = 1, e\ = -1 (148) 

for C£(0, 1). Both algebras have two degrees of freedom and can be represented as 2 x 2 real matrices. 
A representation of C£(1,0) requires two 2x2 matrices that square to unity. A suitable basis for 
this algebra is 

meaning a general element A £ C^(1,0) may be written as 

a b 



A = a + be 1 =[ ba ) (150) 



42 



For C£(0, 1), ef = — 1 and so this algebra is isomorphic to the algebra of complex numbers C 

C£*Ctf(0,l) (151) 

An arbitrary element A G C£{Q, 1) (or equivalently an arbitrary complex number) may be written as 
a linear combination of the C£(0, 1) elements 1 and e\ or equivalently as a linear combination of the 
2x2 unimodular basis matrices 

I- (J?) . = (_»,;) (152 , 

Thus 

A = a + be l =(^ b h \ a,beR (153) 

The geometry of these two Clifford algebras is different from the standard Argand diagram view 
of complex numbers where a complex number z is a point in a two dimensional plane. In the complex 
number algebra, z can be rotated in the complex plane. In the one dimensional geometries described 
by C£(1,0) and C£(0, 1) however, there is no physical rotation operator because space is simply not 
big enough. Given a vector in a one dimensional space, and a corresponding set of lines in a physical 
space, there is no physical operation that will transform the lines into minus themselves (that is, 
an inversion) even though such a mathematical operator exists (multiply by —1). More generally we 
say that in an n-dimensional space, an n-vector may have a mathematical inversion, but there is no 
geometric operation that will turn the corresponding geometric object, an n-multivector, into minus 
itself. We discussed this issue in sections 2] and [5) 

As a final observation, notice that 

det(A) = a 2 + b 2 if A e Ct(Q, 1) (154) 

det(A)=a 2 -b 2 HAeC£(l,0) (155) 

In C£(0, 1) only the trivial element with a = b = does not have an inverse. In C£(l, 0) there are many 
elements for which an inverse is not defined (whenever a — ±b). 

6.2 C£(2,0) and C£(0,2) 

We require a set of four linearly independent matrices that satisfy the commutation relations of the 
basis elements {1, e\, e-y, ei 2 } of the algebras C£(0, 2) and C£(2,0). We encountered these algebras in 
section [3] where it was shown that the homogeneity and isotropy of 2D space, together with Pythagoras' 
theorem, gives one of these two algebras depending on the choice of metric. 

We begin by considering the algebra of all 2 x 2 matrices with real entries, Mat(2,K). One useful 
basis for this algebra is l2,£, rn, n with 

'-(*:) <-(s-°i) »=(;;) »-(-°.s) «-» 

These basis elements satisfy 

l 2 = I 2 = m 2 = 1, and n 2 = -1 (157) 

An arbitrary 2x2 matrix can be written in this basis as 

'ab\ a + d /l 0\ , a-d (\ \ , b + c ( l\ , b-c ( 1 



cdj 2 V° 1 J ' 2 ^ n :l '> "2 \U\ j 2 \ -1 ' l ' r)M 

l2,l,rn and n satisfy the multiplication rules 

£,m = —m£ = n, ran — —nm = £, and n£ — —£n = m (159) 
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which are precisely the rules satisfied by the basis elements ei,e2 and e 12 of C£(2,0). An arbitrary 
multivcctor A in this algebra may therefore be represented as 

A = al + be 1+ ce 2 + de 12 = ( ^ J £ + f\ (160) 

A matrix representation of C£(0, 2) in terms of Mat(2,R) cannot be found because Mat(2,R) has 
only one of its basis elements square to minus unity whereas C£(0, 2) has three. A set of three matrices 
that square to minus unity is needed. It is easily proved that 3x3 real matrices are also too small. 

A representation can be found in terms of the 2x2 matrices with complex entries or a representation 
in terms of 4 x 4 matrices with real entries. A suitable set of sixteen 4x4 matrices is constructed by 
taking tensor products of the 2x2 basis elements I 2 ,£, m, n. 

A 1 =I 2 ®I 2 =( 1 *®) A 2 = I 2 ®£- (h ° 



o h) -' ^-^ v°- /2 

/ I 2 \ . r ( h 

A 3 = I 2 ®m = ( j pi A 4 = I 2 n = I _j Q 



A 5 = £ 7 2 = . . A e = 



£o\ , . . fe o 



A 7 = £®m= [°/ a ) A s =l®n= ' ll 



10 J "»--o"-- ^_^ 

1., = m U= I ™ M A w = m®£ = ' ' "' ° 



mj w ^ \ -to 

. , m\ . ( to 

An = to m = n A12 = to n 



TO U / V —TO 

.!.!, = /( /2=( 0n J A 14 -n0^=( o _ n 

Ai 5 = n0TO=(^pj Ai 6 = n0n=f n ™] (161) 



These matrices satisfy 



A? = +l, for i = 1,2, 3, 5, 6, 7, 9, 10, 11, 16 
A 2 = -1, for i = 4, 8, 12, 13, 14, 15 



One possible representation of C£(0, 2) is to choose 

«-^-(i5) —*•-(-!!,.?) «.-^-(S°'l <*«> 

so that an arbitrary multivector A in this algebra may be represented as 

al 2 + dn bl + cto 

"2 



A = 1 + be, + ce 2 + de 12 = ( _^ ^ . dn ) (163) 



This choice of representation is however not unique. 

These matrices also give a representation of the quaternion algebra H, and so the quaternion algebra 
is isomorphic to C£(0, 2) 

C£(0,2)=H (164) 

This 4x4 real representation of the quaternions (i,j, k) is given by 

/ 10\ / 1\ / 10 \ 

I 2 \_\ 01 (0n\_\ 00-10 k _(n0\_ -100 I 

-I 2 ) -10 00 J \n0) 1 \ ,K ~ \0nj 0-1 \ 00) 

\ -100/ ' V-10 00/ v V 01 / 
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A 2 x 2 complex representation of the quaternions i, j, k is given by 

where the matrix element i = \J — 1. We apologize for two different uses for i in the same equation! 
These different matrix representations demonstrate that we have the embedding 

Mat(l, H) c Mat (2, C) C Mat (4, R) (167) 

More generally Mat(n,H) C Mat(2n,C) C Mat(4n,R) for integer n. 

If we wish to, we can of course also represent C£(2, 0) using 4x4 matrices. For example, a possible 
representation of C£(2,0) in Mat(4,R) is 

ei = (o°) e2 = ('om) ei2= (o!!) (168) 

however this representation is just two copies of its representation in Mat(2, R). 

6.3 C£(3,0) and C£(0,3) 

The algebras C£(0, 3) and C£(3, 0) are both eight dimensional with basis {1, e 1; e 2l e3, e23, e 3 i, e±2, ei23j-- 
In section III [3] we showed that both these algebras contain some cyclic structure. Both algebras have 
a four dimensional subalgebra called the even subalgebra spanned by the scalar and the three bivectors 
{1, e23, e3i, ei2j-. These two even subalgebras are isomorphic to each other and also isomorphic to the 
quaternion algebra with the isomorphism 

1-B-l, i^~e 23 , j+*-e 3 i, k <-> -e 12 for C£ + (3,0) (169) 

1-B-l, 4^e 23 , j^e 31 , fcoeia for C£ + (0,3) (170) 

We note as we did in section [4] that C£(3, 0) does not respect the cyclic structure of eie 2 = ei 2 that 
we have for C£(0, 3). 

C£(3, 0) has four basis elements that square to unity and four that square to minus unity. A 
representation of the algebra can be found in term of 4 x 4 real matrices 

h 0\ f£0\ „ fm \ , ( U ;// 

'2 



1 = ^=(o/J ei = As = [0£ e 2 = A 10 = Q _ m e 3 = A n = \ mQ ) (171) 



From these the matrix representations of the other basis elements are readily found to be 

ei2 = A u e 23 = Ai e 31 = -A 15 e i2 3 = ^8 (172) 

It is also possible to find a 2 x 2 complex matrix representation by choosing 

e 1 =cr 1 , e 2 = ct 2 , e 3 =cr 3 . (173) 

where 

'-(If) «-(!S) •»-(!?) -(i-°.) c«) 

are the Pauli matrices that give a matrix representation of the Pauli algebra 

[cr a ,<rb] = 2ie abc <7 c (175) 



From these it is then easy to show that 



e i2 = (Jia 2 = 10-3 
e-23 = C20"3 = ivi 
P31 = C3CT1 = ia 2 
ei23 = ericas = i (176) 
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There are no representations of C£(0, 3) in terms of 4 x 4 real matrices or 2 x 2 complex matrices 
because this algebra has six of its eight basis vector square to minus unity. It is possible to find a 
representation in terms of 8 x 8 real matrices, 4x4 complex matrices and also in terms of the 2x2 
matrix algebra over the quaternions. This quaternion representation is given by 



\ = h. 



ei 



i 

i 



('2 



J 
JO 



e3 



k 
k 



(177) 



e23 = 



i 
i 



e3i 



JO 



ei2 



fcO 
k 



ei23 — 



-1 
-1 



(178) 



Using equations (1651, these matrices can be rewritten as 8 x 8 real matrices because i,j, k can be 



written as 4 x 4 real matrices. 



6.4(71(3,1) and C£(l,3) 

Both C£(l,3) and C£(3, 1) are sixteen dimensional algebras that are candidates to describe the 4- 
dimensional geometry of spacetime. The matrix representations of these two algebras are quite distinct 
as 

C£(l,3) : has 10 roots of + 1 and 

6 roots of — 1, while 

C£(3, 1) : has 6 roots of + 1 and 

10 roots of — 1 

Because C£(3, 1) has only six roots of —1 we can find a representation in terms of 4 x 4 real matrices. 
A suitable representation is: 



ei 



eio 



e23 



6023 



ei23 



h o 

I 2 

n 
n 

to 

I 

£ 
-£0 

771 

-m 

n 
n 



eo 



e-2 



e20 



e3i = 



eo3i — 



e 0123 



m 
m 

£ 
0-£ 

-n 
n 



e3 



e30 = 



I 

eo 



-n 
-n 





—m 

h 

h o 

0-£ 
£ 



-m 




ei2 = 



m 
—TO 



eoi2 — 



-h 
I 2 



(179) 



(Here, I, m and n are as defined in subsection 2 and the set ( 179 ) is a renaming of the set ( 161 1). 

Because C£(l,3) has ten roots of minus unitythe smallest possible real matrices are 8x8. The 
sixteen dimension algebra may also be represented by 2 x 2 matrices with quaternion entries, that is, 



-i(i 



by Mat ( 2, HI ) as follows 



1 = n t) e = 



o h J u V ° -^ 

i \ __ ( A __ / fc 



o-A __ A -A __ (o-k 



'' J " _ ' * J e2 °~ ^j yl 63 °" ^ 

i o\ A o\ A 



<-^\0i) e31 ={0j) ei2 ~\0k 

i \ A' \ A 



''"- :i - I -*J e ° 31 = ^o - Jy l e ° 12 - ^0 -fe 

ei23 = ( _j ) e oi23 = ( / o ) ( - 180 ^ ) 



and by using equations (165), these can be rewritten as 4 x 4 complex, or 8 x 8 real matrices and we 
have an embedding 

C£(l,3) S Mat(2,H) c Mat(4,C) C Mat(8,M) (181) 

Note that this representation explicitly highlights the link of the quaternions with the space-space 
bi- vectors e,-j and the Pauli spin matrices with the space-time bi- vectors e^. Spa cetim e rotations can 



thus be given an acceptable treatment in any of the matrix algebras of equation ( 181 ) 



6.5 Finding inverses in the spacetime algebra C£(l,3) 

We now suggest that the easiest method of finding inverses of multivectors in C£(l, 3) is by making 
use of the 2x2 quaternion matrix representation of the algebra. The advantage of this approach is 
that it avoids the introduction of a new conjugate operator as in |35j . A matrix is invertible if and only 
if its determinant is non-zero. We can apply this condition to the matrix representation of the algebra 
C£(l, 3) and in this way find what multivectors are invertible and which are not. 

An arbitrary 16-componcnt multivector A € C£(l,3) can be written in terms of the Mat(2,H) 



representation (180) as 



A = I ) where qtj are quaternions (182) 

Writing a quaternion q = q\ + q 2 i + q%j + q^k as a 2 x 2 complex matrix, an easy calculation shows 
that the determinant of a quaternion q is given by det(g) = q\ — q\ — q\ — q\ = \q\ 2 . The determinant 
of A may be expressed aaj 

det(A) = \ qil \ 2 \q 2 2 - «ai?u fel 2 (?ii + 0) 

= |<?22| 2 kll - gi2fe X 92l| 2 (<?22 + 0) 

= |<7i 2 | 2 |g 2 i| 2 (911=922 = 0) (183) 

A is singular if and only if det(A) = so that the above equations determines whether a given 
multivector A has an inverse or not. Provided an inverse does exist, it is straightforward to write down 
a formula for the inverse A. For example when neither qn nor (722 are 0, 

Q - 1= An 912 V 1 = An w 12 \ 

V 921 922 / V W 21 Will 



see for example the website http://en.wikipedia.org/wiki/Determinant 



47 



where 

w 2 i = feigiiW - <?22) _1 '72igri 1 

W22 = (922 - QziQuQ^y 1 (185) 

It is thus straightforward (although perhaps tedious) to find the determinants and, when possible, 
the inverses of multivectors in the algebra C£(l,3). In the next subsection we highlight the physical 
significance of when a multivector is not invertiblc. 

6.6 The non-invertible elements of C£(l,3) 

Let us now consider some specific multivectors and find when they are singular. In particular we will 
consider as explicit examples a mono-vector and a bivector. A more complete treatment of what follows 
can be found in [35], where more general multivectors, including those of mixed grade, are considered. 
Consider first a general mono-vector x = (xo, x) = xqCq + x\e.\ + X2^ + x^e^ in C'£(l, 3). In terms 
of the Mat (2, HI) representation 

Xo X ) = ( X ° P ) (186) 

where P = x±i + x-ij + x^k is a pure quaternion (that is, a quaternion with no real part). From the 
previous subsection, an easy calculation gives the determinant of this vector as 

det(z) = \x 2 Q + P 2 \ 2 (187) 

and so x fails to have an inverse if and only if 

x 2 = x 2 - x\ -x%-x\ = Q (188) 

For the case where s is a position vector in spacetime, x 2 is just the invariant interval. From 
relativity we know that this interval being zero, corresponds to x being on the lightcone. Therefore 
the hyperplane where division is not defined for mono-vectors is precisely in agreement with physical 
limitations set in place by the speed of light. 

As another example of a mono-vector, consider the differential operator d, 

d = e d - ei<9i - e 2 <9 2 - e 3 <9 3 (189) 

This operator is singular if 

dl - V 2 = (190) 

Similarly, the vector potential A = (<fi, A) = 0eo + Aie± + ^262 + ^363 does not have an inverse 
when 

2 = |A| 2 (191) 

Via Lorentz transformation it is always possible to find a frame where A2 = A3 = in which case 
|A| 2 = |Ai| 2 . In this frame, the potential A does not have an inverse if (j> = ±|Ai|. 
Next, consider a bi-vector F £ C£(l, 3), written as 

p -(2"a) (192 » 

where Pi and P2 are both pure quaternions. For a pure quaternion P, P 2 = — \P\ 2 and therefore the 
inverse of P is given by 

P_1 = -|Pl2 ( 193 ) 
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Therefore F does not have an inverse when 

P = PiPi l P2 (194) 

Note that 

Pi n 



P2P1 = Pi 



\Pi 

so that this condition implies that |Pi| = |P 2 |. 

The two pure quaternions can therefore be written as 

Pi = |Pi|A, Pf = i 

P 2 = \P2\P2, P| = l 



IP I 2 
Woo 

-' P 2 Pi (195) 



Equation ( 194 1 now implies that 



PiP 2 =P 2 Pi, (196) 

Thus by regarding P\ and P 2 as vectors in 3-space, we have that Pi JL P 2 . In other words, F is singular 
is equivalent to the conditions 

|Pi| = |P 2 |, andPi±P 2 (197) 

The electromagnetic field can be written as a bi- vector in Ci(l, 3). Explicitly, 

F = Eie i + E 2 eo2 + E 3 eo3 + Pie 23 + B 2 e3i + i?3ei 2 (198) 

where Ei and Bi are the electric and magnetic field components respectively [3"6"j . 

The reader is reminded that the space-space bi-vectors ey are isomorphic to the pure quaternions. 
We substitute 

Pi = B, P 2 = -eE (199) 

where E = {E\,E^,,E 3 ) and B = (Pi,P 2 ,P3) are the electric and magnetic Heaviside-Gibbs field 
vectors and e = eoi 2 3 is the pseudoscalar. The lack of an inverse then implies that 

|E| = |B|, and E1B (200) 

that is, F is the bivector that corresponds to free electromagnetic waves. 

6.7 Concluding remarks - What have we achieved? 

Matrices are a natural and useful way of studying various properties of algebras. One down side of 
working with matrices is that the matrix representations are not so clearly tied to the geometry. 

Although most of this section has used the reals, the complex numbers and the quaternions, we 
observe that we need only the rational number field for all the calculations. Because the Clifford 
algebras over the rationals include elements that square to minus unity, we do not need the complex 
number field. 

The representation of Cl(l, 3) in terms of Mat(2,H) highlights the link between the quaternions 
and the bi-vectors e,j and the Pauli spin matrices and the bi-vectors eoj. Rotations can be given an 
acceptable treatment in any of the appropriate algebras. However, as was shown in sections [4] and [51 
only the algebras C£(0, 3) and C£(l, 3) preserve cyclic structure. 
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The spacetime Clifford algebra C£(l, 3) is not a division algebra. This has led the author of reference 
[3], and others, to suggest that the algebra is therefore not a suitable mathematical structure to model 
physical reality. The existence of inverses is indeed very important. 

We have confirmed the observations made in [35 that the areas of the algebra where division is not 
defined correspond to the situations on the lightcone. Therefore, the behaviour of the algebra matches 
the behaviour of the physical universe. We conclude therefore that the lack of division throughout the 
entire algebra is not to be regarded as a weakness of the algebra but a necessity since, it matches the 
behaviour of our physical universe. 



7 Conclusion 

In this paper we have taken up the challenge by [1,2.3 to reinvestigate the assumptions and axioms of 
physics. We have reviewed the basic assumptions made about physical space, in particular its geometry. 
From these assumptions we set out to develop the most appropriate mathematical framework within 
which to describe physical phenomena. 

In section [2] we showed that the observed translational features of rigid objects in the geometric 
space of the 2D physical world leads to a set of operations on the points, lines, and areas of rigid objects, 
and to a vector space. To obtain this result we only had to assume that physical space is homogeneous. 
We did not have to make the usual assumption that space is continuous. Because all physical rigid 
bodies are finite, and measurements of translations have both upper and lower limits £ max and £ m i n 
not all the operations of the vector space have physical counterparts. No physical situation requires 
the number oo to be introduced nor do physical situations require lines whose lengths approach zero 
by a Cauchy process. 

Section [3] reviewed the assumptions that underlie the rotational properties of two dimensional rigid 
bodies. Making the assumption that physical space is isotropic in addition to being homogeneous 
allowed us to find operations on lines to describe rotations. We found that it is not necessary to 
introduce the unit imaginary i to describe rotations by an Argand diagram but rather that rotations 
in the plane can be better described by the geometric bivector xy. 

The concepts of homogeneity and isotropy are readily extended from two dimensions to three. 
Insisting on maintaining cyclic structure for rotations uniquely led to C£(0, 3) as the appropriate 
algebra to describe 3D geometry. This algebra contains three bivectors xy,yz,zx that take the place 
of the usual ix, iy, iz to describe rotations in the three orthogonal planes. Again no continuity conditions 
are needed to be made to arrive at these conclusions. 

In section [5] we extended our discussion of space to include time. We show that we may use clocks 
to deduce that we may regard time as a fourth dimension in a vector space over the rational numbers. 
The homogeneity of time, clocks tick at the same rate today as they did yesterday, together with the 
finite speed of light observed to be c in all inertial frames led to the Lorentz metric of spacetime and 
consequently the spacetime Clifford algebra C£(l,3). Time is different from space. Unlike the three 
spatial axes, the time axis cannot be rotated. Whereas a 3D rigid body may be rotated in a such a way 
that the orientation of a particular line within this rigid body is reversed, the orientation of the rigid 
bodies clock cannot be reversed. Clocks and rigid bodies are thus quite distinct. By assuming that the 
equivalence principle holds for inertial rigid body frames and that the speed of light c is observed to 
be the same in all such frames we derived the Lorentz and Poincare transformations. 

Section [6] addressed the issue raised by Penrose [3] and others, that Clifford algebras are in general 
not division algebras. We have shown that the matrix representation of 2 x 2 matrices over the field of 
quaternions is a very powerful tool to do manipulations and find inverses in C£(l,3). Vectors that do 
no have inverses are on the lightcone. Other elements of C^(l,3) without inverses are generalizations 
of null vectors. We thereby showed that the existence of non-invertible elements in the algebra is not a 
limitation of the usefulness to physics of the algebra but rather that it reflects accurately the spacetime 
properties of physical systems. 

We have demonstrated that a careful study of the assumptions and axioms associated with space- 
time leads to a somewhat richer structure than the standard Lorentz and Poincare algebras. We have 
thus gone part of the way to answering the question raised by Smolin pQ , Woit [5] , and Penrose [3] . 
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